
DTT^fAlTONAL^ rTT^;;;;;;;;^ Number: wo 



AOIK 67/027, C12N9a6 

t- , „ PCT/CA0(V0O43O 

(21) International AppUa.ti««. Number. ^ 

\ „. 20 April 2000 (20X)4.00) 

(22) International filing Date. 



, (U) International Publicatton Number: 
Al (,3) inUrnatU.nalPubB«^^ 



23 April 1999 (23X)4.99) 



US 



(30) Priority DaU: 
' 60/130^08 

(72) Inventors; and ... poRSBERG, Cecil, W. 

nS)lnventors/Applicants(rarUSon/y^ ,q 

I C75) l-^^j^, 44 Hands Dn^c Gudph; On^ 

Sg&SSiU%S!ontarioNOBlCO(CA). 

IT oi - Oransc & Chari, 4900-55 
Sc?.^^«^SbSnkm Centn>. Ton^«o, Ontano M5K 
1H6 (CA). 



(81) Designated S^Us: AH^T.^ • ^m. EE. 

BR. BY. CA, CH, CN. CR, Cl^v^ jp 

ES, n. GB. «5v°^°i^S.S: S LU. LV. MA, 
KE. KG, KP, KR;KZ^. IX rq, RU. 

MD. MG, MK, *f Vm TO TtV^ UA. UG. 
SD, SE, SG, SI. SK, SUTJ, m T ^ 
US. UZ. VN, YU. ^. ZW. ARD^ ^j^^ (aM. 
LS. MW, SD. SL. SZj^T^ H?; patent (AT. 

S.^ ?i' si), BJ O'. CG. CI. CM. 



^^^W/h international fimit for amending the 

amendments, ' 



fuTdcic add constmcts. IMither. annoooics h>«- 



FOR THE PURPOSES 
Codes used to identify Smtcs party to the on the 



OF INFORMATION ONLY 

pages of pamphlets publishing intemaaonal applications under the PCT. 



AM 

AT 

AU 

AZ 

RA 

BB 

BE 

BF 

BG 

BJ 

BR 

BY 

CA 

CP 

cx; 

CH 

a 

CM 
CN 

cu 
cz 

DE 
DK 
SB 



Anstrift 

Aostnltt 

Aicibt^jsn 

BosnUindHcncgovttM 
BaibadM 



BofkiniFaso 
Bulgaria 



Bnatfl 
Bdaim 



Oeatn) African Rcpabric 

Congo 

Switzcdand 

COte d'Tvoire 

Csmctooa 

Chtaa 

Cab& 

GeodiRepobHc 
Qenoany 



Bstonia 



F1 

FR 

GA 

GB 

G£ 

GH 

GN 

GR 

HU 

IE 

IL 

IS 

rr 
jp 

KE 
KG 
KP 

KR 

KZ 

LC 

U 

LK 

LR 



Spain 
Finbnd 
Rapoe 
G^ 

Cooipa 



Greece 

Ireland 

fsnd 

Iceland 

Japan 

Kenya 

Kytsyastan 

Democnsic People*! 

RepiAncciriColea 



L5 
LT 

LU 



Sri 
Uberia 



LV 


Latvia 


MC 


Monaco 


MB 


RcpdbOcofMoUm 


MG 


Madi^aacar 


MK 




Republic of Macedonia 


ML 


Mati 


MN 


MoneoCa 


MR 


Manritama 


MW 


MaSawi 


MX 


Motioo 


NE 


Niger 


HL 


NethcrUnds 


NO 


Norway 


NZ 


New Zealand 


PL 


Poland 


FT 


Poctngal 


RO 


Romania 


RU 


Russian Federation 


SD 


Sudan 


SE 


Swedco 


SG 


SiosV"^ - 



SI 


Slovenia 


SK 


Slovidda 


SN 


Soicsal 


sz 


Swaziland 


TD 


Cbad 


TO 


T090 


TJ 


T^kkislan 


TM 


Ttekmcnstaa 


TR 


1^nlocy 


rr 


"ninidadandToboso 


UA 


Ukraine 


VG 


Uganda 


vs 


United Slatea of America 


uz 


Uzbekistan 


VN 


Viet Nam 


YU 


Yugoslavia 


ZW 


Zimbabwe 



wo 00/64247 



PCT/CAOO/00430 



TRANSGENIC ANIMALS EXPRESSING 
SALIVARY PROTEINS 



FIELD OF THE INVENTION 

The present invention relates to transgenic animals and. more specifically, to anmials 

geneticaUy modified to express a desired protein. 

BACKGROUND OF THE INVENTION 

Phosphorus is an essential element for the gro>v1h of all organisms. L". livestock 
production, phosphonis deficiency has been described as the most prevalent mineral 
deficiency throughout the world and feed must often be supplemented with inorgamc 
phosphorus in order to obtain desired giowth performance of monogastric animals (e.g. pigs, 
poultry etc.). 

Phytic acid, or phytate. (myoinositol U.3.4.5.6-hexakis dihydrogen phosphate) is a 
major storage form of phosphorus in cereals and legumes, representing 18«/o to iS% of the 
total phosphorus content (Reddy et al 1982). The enzyme phytase (jnyo-mo^vtoX 
hexakisphosphate phosphohydrolase) belongs to the groi^ of phosphoric monoester 
hydrolases: it catalyzes the hydrolysis of phytate (myo-inositol hexakis phosphate) to 
inorganic monophosphate and lower phosphoric esters of myoinositol or. in some cases, free 
myoinositol. Phytases are classified either as 3-phytases or 6.phytases based on the first 
phosphate group attacked by the enzyme. 3-phytase is typical for microorganisms and 6- 

phytase for plants (Cosgrove, 1980). 

Phytase is either absent or present at a very low levels in monogastric animals (Bitar 
andReinholdl972;Iqbal .fail 1994). Consequently, dietary phytate is not digested or 
absorbed fiom the small intestine and instead is concentrated in fecal material, thereby 
contributing to phosphorus poUution in areas of intensive livestock production. Runoff fiom 
animalfarmsleadstocontaminationofriversandstreams. Such runoff has resulted in rapid 

drops in the oxygen concentration in rivers and streams due to excessive algal growth m 
water, which, intum. has ledtoanincreaseinthemortaUtyrateof fish and existing flora and 

) fiuouuThisisbecomingaglobalproblemaspigandpoultryproductionisincreas^ 

1999;Mamn2000). Furtheimore. phytic add is viev.ed as an anti-nutritional factor because it 
interacts with essential dietary mincnds and proteins limiting the nutritional values of cereals 
and legumes in man and animals (Harland and Morris 1995). 
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For the above reasons, various attempts have been made to enable animals to utiUze 
available phytate in feed. Such attempts have included production of low phytate plants 
(Abelsoa 1999), addition of phytase to the animal feed (Simons et al. 1990) (Stahl et al. 
1999) or transformation of the fodder plants to produce the required phytase (Pen et al. 1993. 
5 Venvoerd et al 1995). A combination of these options, the feeding of phytase to poultry 
receivinglowphytatecomhas also been tested (Huff 1998). However, these solutions 
increase the cost of animal production. Also because phytase is an enzyme, it is susceptible 
to inactivation by heat and moisture and is generally unstable at the high temperatures used 

for feed pelleting. _ 
10 -me primary phytase used for supplementing animal feeds is fiom AsperisUius 

however, phyta^areproducedbyalargenumberofplantsandmicroorganisms(Wodzmsla 

andUUah 1996) (Dvorakova 1998). A phytase produced by ficftencAia co/ihasbeen 
reported to exhibit flxe hi^est activity of those reported (Wodzinski and Ullah 1 996). Tins 
phytase from E. coH was initially cloned as an acid phosphatase gene that was designated 
15 ^P^(Dassaera/.1990).Greineretal.(1991;1993)purifiedphytasefiom£.«7/iand 

reported that some of the kinetic properties of the acid phosphatase activity of the native 
phytase of E. coli were similar to those of the APPA<^coA^ acid phosphatase. However, 
the authors did not clone the phytase gene to prove that it was identical to APPA gene. We 
have subsequently cloned, overexpressed and characterized APPA gene, and shown that the 
20 E co/,gene^P^codesforabifimctionalenzymeexhn,itingbothphytaseandacid 

phosphatase activities (Golovan et al 2000). Phytases exhibit phosphatase activity, however 
the relative activities differ widely among enzymes (Wodzinski andUUah 1996). 

Therefore, tiiere is a need for an improved method of aUowing access by ammals to 
phytase so as to enable efficient phytate metabolism and. thereby reducing phosphate 

25 polludcMi. . 

Ihthe fieldofproteiiiproductionusingiccombinant methods, one of the associated 

problemsrelatestothelackofrequiredglycosylation. Therefore. a mefliod of producmg 
sudi gfycoproteins is also needed. 

30 SUMMARY OF THE INVENTION 

In one embodiment, the invention provides a ttansgemc non^uman ammal ttiat 
carries in the genome of its somatic and/or gemi cells a nucleic acid sequence including a 
heterologous transgeneonurtruct. the construct including a to^ encoding aprotem. the 
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transgene being operably linked to a first regulatory sequence for salivary gland specific 

expression of the protein. 

In another embodiment, the invention provides a transgenic non-human animal that 
carries in the genome of its somatic and/or germ cells a nucleic acid sequence including a 
5 heterologous transgene construct, the construct including a trangene encoding phytase or a 
homologue thereof. 

In yet another embodiment, the invention provides a method of expressing a protem. 

the method comprising the steps of: 

a) introducing a transgene constmct into a non-human animal embryo such that a non- 
10 human transgenic animal that develops from the embryo has a genome that comprises the 
transgene construct, wherein the transgene construct comprises: 

i) a transgene encoding the protein, and 

ii) at least one regulatory sequence for gastrointestinal tract specific expression 
of the protein, 

15 b) transferring the embryo to a foster female; and, 

c) developing the embryo into the transgenic animal 
wherein the transgene is produced in the gastrointestinal tract of the animal. 

fa a further embodiment, the invention provides a transgenic animal adapted for 

expressing a protein according to the above method. The invention also provides for the 

20 progeny of such animal. 

In another embodiment, the invention provides a process for producing a protein 

comprising the steps of: 

a) obtaining saUva containing the protem fiom a non-human transgenic animal, the 
animal containing within its genome a transgene constmct, wherein the tra^^ 

25 comprises: 

i) atnmsgne cncodiDg tiie protein, and 

ii) at least one regulatory sequence for saUvary gland specific expression of 
theprotdn,and 

eachacting the protein fiom flie saliva. 

In a further embodiment, the invention provides a method for expressing a phytase or 
a homologue thereof in a non-human animal, the method comprising: 

a) constructing anucleic add sequence including a transgene construct comprismg: 
i) atransgene encoding flic phytase orahomologue thereof and 
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ii) at least one regulatory sequence for gastrointestinal tract specific expression 
of the protein, and 

b) transfecting the animal with the nucleic acid sequence; 
whereby the animal carries vdthin the genomeofitssomaticand/or germ cells the transgene 

. construct and wherein the animal expresses the phytase or a homologue thereof m its 
gastrointestinal tract 

In another embodiment the invenUon provides a nocleio »dd molecule comprtsmg a 
nueleic acid se,ue„oe including a gene encoding a protein, the gene being operably linked U, 
« least one regulatory sequence for gastrointestinal tract specific expression of *e pr»U=m. 

In another embodiment the invention provides an antibody specific to the protem 
expressedby the above nucWc »=id sequence and a test kit for immunologically det^tmg 
such pn.t«n. The invendon also provides for hybridomas secreting such antibod.es. 

in another embodim«« the invcntioo provides cells that are transfect«l with the above 

nucleic acid sequoice. 

^another embodiment, theinventionprovidesamethod for producingaprotem 

molecule compxisingaglycosylaledpmteinsccretedin the saliva that ^^^^ 

physiological activity. Oneexampleof such an activity is phytase. 

BBIEF DESCRIPTION OF THE DRAWINGS 

•n^ese and other features of the preferred embodiments of the invention wiU become 
more apparent in the following detailed description in which reference is ^ 

appended drawings wherein: 

Figure 1 is. schematic diaBramrepr^entingamethodfeproducinglhe gene 

constntctofthe present i.<.entio.contai.mwtoind»a^lep.«Une^chp.otm 

pn^oter^erfumccr. M«especiflcaUy.F.g»r.lfa.»*«n««fc«»e»-i»^'^'=^ 
in.heconsT«i«.of.het.3nsgenesRl5.APPA4tatron.ndR15/APPAus«lfcr4e 

generation of transgenic mice. 

Figure 2 is a schematic diagram t^resenting a method for producing the gene 
construct ofthe present inventioncontainingtheSV40 promoter. More specifically, Ft^ 2 
30 isaschematicdiagramiUustratingthestepsinconstructionoftheplasmidco^^^^ 
transgcneSVAO/APPA-Hntron flu. was introducedbytransfection into m^^^ 

Figure 3 is a schematic diagram representing a method for producing the gene 
construct of tf»e present invention containing flie constitutive parotid secretory protem (PSP) 
■^promoter/enhancer. More specifically. Figure 3 is a schematic diagram illustrating the step^ 
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in construction of the transgenes Laraa2/APPA that codes for the native AppA phytase and 
the Lama2/PSP/APPA that codes for the AppA phytase with the PSP signal peptide sequence. 

Figure 4 is a schematic diagram of the Laraal-APPA plasmid containing the APPA 
traosgene. 

Figure 5 illustrates the nucleic acid sequence of the Lama2/APPA plasmid containing 
the£. coUAPPA gene (SEQ ID NO: 1). 

Figure 6 illustrates the PGR results for transformed mice. More specifically, figure 6 
is a picture of an agarose gel illustrating APPA PGR products fi-om genomic tail DNA of 
third generation of&pring from the transgenic female founder mouse 3-1 generated using the 
Xhol and Notl fragment of the Lama2/APPA construct A second generation phytase gene 
positive male was crossed with each of two phytase positive transgenic females 9f and 1 If 
(Table 3). From Utter I8m x 9f offepring 3, 4, 5 & 6 are PGR positive and from litter 18m x 
llf ofispring 2 and 3 are PGR positive. Std is the oligonucleotide standard and the numbers 
on the left arc the bp sizes of the standard. Lane C is a negative control reaction mixture that 
lacks a DNA template and appA is a positive control contaming an ampfified segment of the 
phytase gene. The primers used were APPA-UP2 and APP A-KPN. 

Figure 7 iUustrates the PGR results for transformed founder pigs. More specificaUy, 
Figure 7 is a picture of an agarose gel illustrating phytase gene PGR products and p-globui 
PGR products from genomic taU DNA of five founder piglets from Utter 167. Std is a 1 kb 
ladder. Lane 2 using the phytase primer set is positive for the phytase gene, and all of the 
samples were positive for the p-globin gene. Lane C is a negative control not containing 
template DNA. The phytase transgene primer set included APP A.UP2 and APPA-KPN gave 
an expected fragment size of 750 bp. The primer set for the P-globm gene included PIG-BGF 
and PIG-BRG pves an escpected fragment size of 207 bp. 

Figure 8 fllustratcs die PGR results for transgene rearrangement tests. More 
specifically. Figure 8 is apictuic of an agarose gel showing the PGR products of four 
separate primer sets used to amplify different segments of the transgene introduced into pig 
167-02. The Std contained a kilobase DNA ladder. The primers used included lane 1, APPA- 
UP2 and APPA-KPN (750 bp); lane 2, APPA -MATURE and APPA-KPN (1235 bp); lane 3 
APPA MATURE and APPA-D0WN2 (608 bp); lane 4, PIG-BGF and PIG-BGR (207 bp), 
lane 5, a negative control without DNA template added; lane 6, the eqfpA gene & primers 
APPA-UP2 and APPA-KPN. The numbers on the left indicate ttie sizes of the bands in the 
standard. No PGR products were detected m the absence of either DNA template orpiimere. 
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Figure 9 illustrates weight and saUvary phytase activity of the transgenic boar 167-02 
and average weight of the pen-mates at intervals during growth. Symbols: Weight of 1 67-02, 

Average weight ± SD of four penmates, A; phytase activity of 167-02, ■; Phytase 
specific activity, □. Arrows indicate sampling for fecal phosphorus concentration. 

Figure 1 0 illustrates weight and salivary phytase activity of the transgenic boar 282- 
02 and average weight of the pen-mates at intervals during growth. Symbols: Weight of 282- 
02, Average weight ± SD of five penmates, A; phytase activity of 282-02, ■; Phytase 
specific activity, □. Arrows indicate sampling for fecal phosphorus concentration. 

Figure 1 1 illustrates weight and salivary phytase activity of die tiansgenic boar 282- 
04 and average weight of the pen-mates at intervals during growth. Symbols: Weight of 282- 
04, •; Average weight ± SD of five penmates. A; phytase activity of 282-04, ■; Phytase 
specific activity. □. Arrows indicate sampling for fecal phosphoms concentration. 

Figure 12 illustrates wei^t and saUvaiy phytase activity of the transgenic boar 405- 
02 and average wei^t of the pen-mates at intervals during growth. Symbols: Weight of 405- 
15 02, Average weight ± SD of four penmates. A; phytase activity of 405-02. ■; Phytase 
spedfic activity. □. Arrows indicate sampling for fecal phosphorus concentration. 

Figure 13 fflustrates wei^t and saUvary phytase activity of the transgenic boar 421- 
06 and average weight of the pen-mates at intervals during growth. Symbols: Weight of 421- 
06. •; Average weight ± SD of four penmates. A; phytase activity of 421-06, ■; Phytase 
20 specific activity. □. Arrows indicate sampling for fecal phosphorus concentiatibn- 

Figure 14 illustrates the PGR results of first generation pigs. More specifically, 
Figurt; 14 is a picture of an agarose gel showing the PGR analysis of eight Uter 154 piglets. 
The phytase transgenic boar 167-02 was used to breed a non-transgraiic fenude. Std, 100 bp 
ladder, numbers on left are the sizes of die fragments in each band in bp; Une 167-02, DNA 
25 fiomboar 167-02 1. DNA fixMn 167-02; lane Q is a lane without added DNA; lanes 1-S. are 
amplified DNA inserts from each of tiie oflfepring piglets of die Utter. Phytase primers were 
Lama-UP and APPA-D0WN4. p-globin primers were PIG-BGF and PIG-BGR. 

Figure 15 illustrates a sodium dodecylsulfate gel stained with silver demonstrating die 
sizes of the E. coli produced APPA phytase and die APPA phytase produced by tije pig and a 
30 demonstration that the pig phytase is glycosylated. More specifically. Figure 15 is a picture 
of a sodium dodecylsulfete polyaaylamide gel electrophoresis (SDS-PAGE) profile of tiie 
purified AppA phytase produced in E. coli and the purified pig salivary phytase stained 
diiectty witii silver (A) and a transfer from a similar SDS-PAGE gel transferred to 



RECTIFIED SHEET (RULE 91) 



wo 00/64247 



7 



PCT/CAOO/00430 



nitrocellulose and stained for glyopioteins (B). Creatinase is not glycosylated while 
transferring is glycosylated. The numbers on the left are the masses in of the molecular mass 
standards (Std) expressed in kDa. 

Figure I SB is a picture of Western blot of the untreated pig App A phytase and the 
5 same phytase after treatment with a combination of three deglycosylating enzymes. Lane 1, 
Purified AppA phytase produced in E. coli (untreated); lane 2, purified pig phytase 
(untreated): lane 3, purified pig phytase treated with the combination of deglycosylating 
enzymes including N-glycosidase F. 0-glycosidase and neuraminidase. 

Figure 16 iUustrates a Western blot of the pig phytase and the E. coli produced APPA 
10 phytase usmg monoclonal antibodies directed to the APPA phytase documenting that they 
have homologous epitopes. More specifically. Figure 6 is a Western blot of the AppA 
phytase from pig saUva after various purification steps and of purified phytase produced in E. 
coli. A monoclonal antibody prepared against the E. coli phytase was used as the primary 
antibody for detection. Lane 1, saliva from non-tcansgenic pig 164-04; lane 2. saUva from 
15 transgenic pig 167-02; Lane 3. saUva fraction not bound to DEAE-Sepharose; lane 4. 
saUvary phytase bound to DEAE-Sepharose and released witfi an NaCl gradient; lane 5. 
salivary phytase fiirther purified by Chromatofocusing with a pH gradient of 4 to 7; lane 6. 
phytase purified from E. coli. The numbers on the left are the masses of molecular mass 
standards (not shown) expressed in kDa. 
20 Figure 17 iUustrates an SDS-Page of the E. coli APPA phytase. saUva samples from 

phytase negative and positive pigs and mice and a corresponding Western blot documenting 
that phytases from all three sources have homologous antigenic epitopes, but the animal 
phytases are larger than that produced in E. coli. More specifically. Figure 6 is a SDS-PAGE 
profile of the purified £1 coli produced AppA phytase and the AppA phytases produced by 
25 pigs and mice stained with silver (A) and a Western blot of an identical set of protein samples 
(B). A polyclonal antibody prepared against the E coli phytase was used as the primary 
antibody for detection. Lane 1, Purified AppA phytase produced in E. coli; lane 2, SaKva 
from anon-transgenic pig 164-01; lane 3. Saliva from a AppA producing transgenic pig 167- 
02; lane 4. Purified phytase from pig 167-02; lane 5. Saliva from a non-transgenic mouse; 
30 lane 6. Saliva from a transgenic mouse containing R15/APPA transgene induced witti 

isopiotcienol; lane 7. Saliva from a transgenic mouse containing the Lama/APP A transgene; 
Std, Molecular mass markers. The numbers on tfie left are the masses of molecular mass 
stan4ards (not shown) expressed in kDa. 
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Figure 18 illustrates the nucleic acid sequence of the known segment of the 
R15/APPA + intron plasmid including the vector sequences of pBLCAT3 (SEQ ID N0:2). 

Figure 19 illustrates the nucleic acid sequence of the known segment of the 
R15/APFA + intron transgene construct used for the generation of transgenic mice (SEQ ID 
5 N0:3). 

Figure 20 illustrates the nucleic acid sequence of the known segment of the 
R15/APPA plasmid including the vector sequences of pBLCAT3 (SEQ ID N0:4). 

Figure 21 illustrates the nucleic acid sequence of the known segment of the 
R15/APPA transgene construct used for the generation of transgenic mice (SEQ ID N0:5). 
10 " '"Figure22mustratesthenucleicacidsequenceoftheSV40/APPA + intronpl^^^ 

(SEQIDN0:6). 

Figure 23 iUustrates the nucleic acid sequence of the Lama2/APPA transgene 
construct used for the generation of transgenic mice and transgenic pigs (SEQ ID NO: 7). 

15 DESCRIPTION OF THE PREFERRED EMBODIMENTS 

In the following description, a number of recombinant DNA technology temis are 
used. ThefoUowingdefinitionshavebeenprovidfidinordertocnableaclearer 

understanding of the specification and appended chiiras: 

"Promoter" - aDNA sequence generally described as the 5' region of a gene and 
20 locatedproximaltothestartcodon. The transcription ofan adjacent gene is initiated at the 
promoterregion. If a promoter is an inducible promoter then the rate of transcription 
increases in response to an inducing agent. A constitutive promoter is one that ini^ 

transcription ofan adjacent gene without additional regutetion. 

-Operably linked" - a nucleic add sequence is "operably linked" when placed into a 
25 functional relationship with another nucleic add sequence. For instance, a promoter or 

enhancer is "operably Unked" to a coding sequence if the promoter causes the transcription of 
the sequence. Generally, operably linked means that the linked nucldc acid sequences are 
contiguous and. where it is necessary to join two protein coding regions, contiguous and m 
one reading firame. 

30 "Phytase- - any protein that liberates phosphate ftom myo-inositolhexakis-phosphate 

or other inositol phosphates. Its catalytic capabiUty may be limited to phytic acid or one of 
its salts, or itmayshowlessspecificity and hydrolyzc a variety of phosphorylated 

compounds. 
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"Gene" - a DNA sequence that contains a tonpUte for an RN A polymerase ana 
contains information needed for expressing a polypeptide or protein. 

"Polynucleotide Molecule" - a polydeoxyribonucleic (DNA) acid molecule or a 
polyribonudeic acid (RNA) molecule. 

"Expression" - the process by which a polypeptide is produced from a structural gene. 

"Cloning vehicle" - is a plasmid or phage DNA or other DNA sequence which is 
cjq)able of carrying genetic information into a host cell. A cloning vdiicle is oftrax 
characterized by one or more cndonuclease recognition sites at which such DNA sequences 
may be cut in a determinable fashion without loss of an essential biological function of the 
vehicle. A cloning vehicle is a DNA sequence into which a desired DNA may be spliced in 
order to bring about its cloning into the host celL 

"Vector" - is a teem also used to refer to a cloning vehicle. 

♦Tlasnrid" - is a cloning vehicle generally comprising a circular DNA molecule that is 
maintained and rq)licates autonomously in at least one host cell. 

"Expression vducle" - a vdiicle or vector similar to a cloning vehicle but which 
supports expression of a gene that has been cloned into it, after transformation of a host The 
cloned gene is usually placed under the control of (Le. is operably linked to) certain control 
sequences such as promoter sequences. 

"Host" - a cell that is utilized as the recipient and carrier of recombinant maleriaL 

"Homologous" - refers to a nucleic acid molecule that originates from the same genus 

or ^edes as tiie host 

"Heterologous" - refers to a nucleic add molecule fljat origjnates from a diffecot 

genus or species dun diat of the host. 

"Glycoprotein" - fcfere to a peptide molecule that has undCTgone glycosylation. 

"Glycosyialion" - refers to Ac addition of carbohydrate groiqps to a amino add 
residues of a peptide molecule. 

In recent years, transgenic animals have been developed for many purposes CPinkrat 
et al. 1990) (Wall ei al. 1997). One premise, therefore, for the present invention is that by 
providing a transgenic animal capable of expressing phytase, the problems discussed above 
would be obviated. The options for hetoologous phytase expression in animals include (i) 
salivary gland secretion of a phytase, (ii) pancreatic secretion of the enzyme into the small 
intestine along with the digestive enzymes, or (iu) secretion from the intestinal epitheUal cells 
much like that of indigenous alkaline phosphatase and glycosidases (Low, 1989). The £. coii 
phytase would appear to be best suited for hydrolytic activity in the monogastric stomach 
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because the enzyme has a pH optimum in the range of 2.5 to 4.5 and it is resistant to pq)an, the 
predominant protease active in the stomach. The phytase has a periplasmic location in £. coli 
and has an N-tenninal signal peptide sequcaice (Golovan ct al., 1999) that seoned optimally 
adapted for secretion from the parotid gland. Phytase could be expressed in either the pancreas 
for secretion into the small intestine or it could be expressed in tiie intestinal epitiielial tissue 
and secreted into the intestinal milieu. However, if these choices of expression locations 
were chosen, it would be necessary to select an enzyme active at the more neutral pH of the 
small intestine and one which was more resistant to pancreatic enzymes including trypsin, 
chymotrypsin and elastase. 

Factors of importance in terms of the expressed enzyme when selecting a phytase for 
expression in the gastrointestinal tract include a pH that is optimum for activity, high 
catalytic activity, broad substrate specificity, and protease resistance. If any of tiiese 
piopartics, or indeed otiiers, is not accq)table, there are now sophisticated molecular methods 
for modifying tiie properties of an enzyme. These include site directed mutagenesis, random 
mutagenesis and various modifications of DNA shuffling OSatayama, 1998; Cramm aL, 
1998). 

Synthesis of phytase in the salivary gland and secretion in the saliva would, thwefore, 
provide for early contact of the enzyme with phytic acid present in the feed and provide 
suffici^it time for hydrolysis. 

The salivary gland system of the pig consists of three pairs of glands, the parotid gjand, 
which secretes through a duct on each chedc. and mandibular and submajuUary glands that have 
joint ducts that searete beneath the firont on the tongue. Saliva secreted in the pig via ttiese ducts 
is discontinuous and is produced during consumption of solid foods, and can equal the weigjit of 
food consumed when water is limited during feed consumption (Corring, 1980; Aildiipovets, 
1956). For example, the quantity of saliva produced by a 45 kg pig can vary from near zero 
when the pig receives a mainly liquid di^ to 500 g when a dry diet is consumed wiftout access 
to water. The salivary glands of the pig secrete amylase (Roaiikov and Galiraov, 1990) and a 
variety of other salivary proteins and mucopolysaccharides. 

To our knowledge no porcine genes coding for salivary proteins have been cloned. 
However, genes coding for major proteins secreted by the rat and mouse have been cloned and 
characterized A multigene family encoding a group of unique proteins hi^ in proline, the so- 
called proline-rich proteins (PRPs) are produced when dthCT nuce or rats consume tannins or are 
injected with isoproterenol 
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It would be advantageous to develop an animal fliat is transformed to express phytase, 
preferably in the salivary gland. In such case, the phytate naturally occurring in the animal 
feed can be utilized by the animal without any additives being used. This will decrease the 
cost of animal production, and furthermore, will avoid polluting the environment with 
phosphorus. Therefore, the present invention aims to overcome the deficiencies of the prior 
art relating to increasing phytate utilization and, particularly, to provide transgenic animals 
which express phytase. 

In the production of heterologous proteins by means of recombinant methods, several 
hurdles have been faced. One such hurdle that is often faced is the lack of required post- 
translational modification of the expressed protein thereby resulting in a protein that is 
stnicturally and/or functionally, different from the desired molecule. Glycosylation is one 
such post- translational modification that is desired. However, such modification is generally 
found to occur in more complex mammalian systems. Therefore in one embodiment of the 
present invention Aere is provided a method of producing recombinant glycoproteins. 

JiL one embodiment, the presort invention provides an anhnal capable of inducible or 
constitutive salivary «cpression of a heterologous protein. To illustrate this, the mouse was 
chosen as the animal modd and the gene constructs used for transformation were created 
using ttie rat proline-rich protein (PRP) promotei/enhancear (inducible promoter) and the 
mouse parotid secretory protein (PSP) promoter/caihancer (constitutive promoter), hi this 

illustration, phytase was used fox expression in saliva. 

After finding that an inducible phytase could be expressed in the parotid 

of mice the expression of the phytase transgene undCT flie control of the constitutive PSP 

promoter was then tested. Two mice transgenic for the PSP constract were produced under 

contract at tiie Uhivei^ty of A la b a m a. 

FoUowing the testing of fl»c nuce described above, transgenic pigs were developed by 

introduction into tiie genome a phytase transgene consisting of a constitutive promoter 

driving the synthesis of a hi^y active phytase. The pigs so generated ware found to exai^ 

less phosphorus in their feces than non-transgenic pigs. 

Expression in the Salivary Glands 

Saliva is a clear colorless fluid secreted by major salivary glands (parotid, 
submandibular, sublingual and minor saUvary) that lubricates and cleans the oral struchire, as 
well as initiates the process of digestion. The parotid glands are two of six major glands 
associated vnth ttie production of saliva. The parotid gland is composed mdnly of two cell 
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types: acinar and interglobular duct cells. The acinar cells, which represent 75 to 85%of the 
tissue, are the sites of secretory protein synthesis (Frandson and Spurgeon 1992). Two very 
abundant proteins are produced by these cells: a-amylase (AMY-1) (2% of polyA RNA) 
(Madsen and Hjorth 1985), and parotid secretory protein (PSP) (10% of polyA RNA) (Shaw 

5 and Schibler 1 986). Several constructs are now available which allow tissue-specific 
expression of a transgene in the salivary glands of mice. 

The salivary secretion in pigs has not received the attention given to that of mice and 
humans. It was suggested that salivary secretion is discontinuous (less secreted between 
periods of meal consumption). Up to 500 g of saUva may be secreted by a 45 kg pig upon 

10 consumption of 500 g of dry feed (Coning 1980). Wide variations were detected in both the 
flow rate and electrolytes in saHva between animals and even between samples taken from 
the same animal on separate days (Tryon and Bibby 1966). Very Uttle is known about the 
composition of pig's saliva or saUvary enzymes. Salivary amylase was detected, although the 
quantity was 250 000 times less than that of pancreatic amylase, and 100 times less than in 

15 human saUva (Low 1989). There are no constructs known which would allow salivary gland- 
spediic expression of transgene in pigs. 

n APPA Gene Under Control Of An Indn a'ble Promoter 

20 1) Construction of R15/AFPA constructs (Inducible Promoter) 

In this process, a plasmid is constructed by linking a promoter/enhancer for a sattva 
protem with the APPA gene, which codes for the bifimctional phytase, add phosphatase. The 
APPA gene used in this construction was cloned from K coli ATCC 33965 into pBR322. 
This is described above (Golovan ct aL. 2000). 

25 Proteins, unusually hi^ in proline, the so-called ptolinMich proteins (PRPs), 

comprise about 70% of the total proteins in human saliva (Beonick 1982). Unlike the 
constitutive expression of the PRPs in humans, the salivary glands of mice, rats and hamster 
normaUy eitiier do not express PRPs or express them in low levels, hi tire rat and mouse. 
PRP gene expression can be dramatically induced by diets high in tannins or by injection 
30 witii tiie p-agpnist isoproterenol (Carlson 1993). After 6 to 10 days of daily isoproterenol 
injection the PRPs comprised about 70% of tiie total soluble protein in parotid gland extracts. 
PRP cDNA and PRP genes have been cloned and characterized from rats (Clements et al. 



RECTIHED SHEET (RULE 91) 



wo 00/64247 



13 



PCT/CAOO/00430 



1985), mice (Ann and Carlson 1985), hamsters (Mehansho et al 1987). and humans 
and Maedal986). 

Transgenic mice were used to locate the cis-acting DNA elements that are essential 
for sabvary-specific and inducible expression of the rat proline-rich protein gene, R15. It was 
found that a parotid control region (-6 to -1.7 kb) upstream of the R15 promoter is capable of 
directing parotid-specific and isoproterenol-inducible expression of a heterologous promoter 
construct (Tu et al 1993). The distal -10 to --6 kb region was shown to function as an 
enhancer, which can increase levels of expression more than 30-fold. The -6 to -1.7 kb 
region also seems to fimction as a locus control region (LCF.), because it conferred copy 
number-dependent and chromosomal position-iadq)endent expression of a rqporter gene in 
15 out of 15 independent transgenic mice (Tu, Lazowski, Ehlenfeldt, Wu, Lin, Kousvelari, 
and Ann 1993). 

We obtained the R15-PRP promoter from Dr. DJC- Ann as a plasmid -lOiJiJ/CAT, 
which placed the chloramphenicol acetyitransferase gene (CAT) under control of the 
inducible R15-PRP promoter. We decided to use the plasmid as a basis for transgene 
construction (Figure 1). Due to the absence of complete sequCTce infomiation about the 
R15-PRP piDmotCT (only 2 kbp out of 10 kbp was sequenced) we removed the R15-PRP 
promote by Xho I digestion (Figure 1, step 1). Re-ligated plasmid was used as a template 
for PCai with CAT-ATG and CAT-TAA synthetic primers. The 4 J kbp CATpcr fiagmoit 
had the initiation site of the CAT gene substituted with the optimal eukaryotic initiation 
sequence (Kozak 1987). The fragment was purified by agarose gel electrophoresis, re-Ugated 
to itself and used to transform E. coli (Figure I, step 2). The CATpcr pl^^smid was digested 
with Nco I and fiUed-in usmg T4 DNA polymerase to gen^e a blunt «iA After that, the 
CATpcR fragment was digested with Eco47III and purified by agarose gel electrophoresis 
figure 1, step 3). Three rare codons in the ilP/^l gene were modified during the sub-cloning 
steps leading to the construction of the transgene. Specifically, the Ala3 coding sequence was 
changed from GCG to GCC, the Pr0428 sequence was changed from CCG to CCC, and the 
Ala429 sequence was changed from GCG to GCT. This modification was made in order to 
increase the possibility of transcription of the gene in eukaryotic cells. The APPA gene was 
amplified by PCR using the previously cloned AFPA gene fcom tiie pBKi22/ APPA plasmid 
with the synthetic primers APPA-DRA and APPA-SMA, The 13 kbp APPApcr ftagment 
generated by PCR was digested with Dra I and Sma I and gel-purified (Figure 1, step 4), 
APPApcRand CATpcr fragments were blunt end ligated to produce CAT/APPA+intron 
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vector (Figure I, step 5), which was introduced into a DH5a strain of jB. colL The insect 
orientation was checked by restriction digest with Sal I and EcoR I. The transgene in 
CAT/APPA+intron was checked by sequencing both strands. To remove the S V40 small t 
intron the 23 kbp APPAfmtTOX\/polyA fragment was excised from a plasmid by Xho I and 
5 EcoR I digestion (Figure 1 , step 6a), gel purified and digested by Dra I (Figure 1 , step 6b). 
The I -5 kbp (APPA) and 0.2 kbp (polyA) fragments were gel-purified and linked together in 
three way ligation with CATpcr digested with Xho I and EcoR I (Figure 1, step 6c). The 
resulting plasmids CAT/APPA and CAT/APPA+intron were digested with Xho I, gel- 
purified and re-ligated with R15-PPJ? promoter digested with Xho I figure 1, step 7). 

1 0 Because of the low efficiency of ligation the whole ligation mixture was used to transform 
ExolU total plasmid DNA was prepared and run on the agarose gel. Plasmids which were 
larger than the original CAT/APPA (5.6 kbp) were eluted and re-transformed in E.colu 
Plasmids with the R15-PRP insert (15 kbp) were identified by electrophoresing DNA fiTom a 
single colony on an agarose geL The correct orimtation was identified by PCR with R15- 

15 UPland APPA-D0WN2 synthetic primers. The plasmids R15/APPA and R15/APPA+intron 
were both digested with Hind m and Kj)n I; transgenes were gel-purified and fiirther purified 
using a Qiagen column (Figure U step 8). 

Figure 1 8 illustrates the nucleic acid sequence for the plasmid containing the known 
segmomt of the R15/APPA + intron sequence including the vector sequences of pBLCATS. 

20 The sequence of ttiis plasmid is designated as SEQ ID N0:2. 

Figure 19 illustrates the nucleic acid sequence for the transgene construct containing 
the known segment of the R15/APPA + intron sequence used for the generation of transgenic 
mice. The sequence of this transgene is designated as SEQ ED NO:3. 

Figure 20 illustrates the nucleic acid sequence for the plasmid containing the known 

25 s^ment of the Rl 5/APP A sequence including the vector sequences of pBLCAT3. The 
sequence for diis plasmid is designated as SEQ ID N0:4. 

The pBLCAT3 sequence indicated above is presMit in the CAT/APPA of Figure 1 and 
in the CAT/APPA+intron of Figure 2. This sequence was part of the ori^nal -10R15/CAT 
and a portion of it was earned through in the construction process. 

30 Figure 2 1 illustrates the nucleic acid sequmce for the transgene constmct containing 

the known segment of the Rl 5/APP A sequence used for the graeration of transgmic mice. 
The sequence of this transgene is designated as SEQ ID NO:S. 
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2) Expression of SV40/APPA+intron in Cell Culture 

To produce an SV40/APPA plasmid for expression otAPPA in cell culture, the SV40 
promoter/enhancer was amplified by PGR from the pSV-p-galactosidase plasmid (Promega) 
using the synthetic primers S V-HIND and SV-XHO. The SV40 promoter/enhancer fragment 
5 was digested with Xho I and Hind m, gel purified, and ligated into CAT/APPA digested with 
Xho r and Hind m (Figure 2). 

Figure 22 illustrates nucleic acid sequence for the SV40/APPA + intron. The 
sequence for this plasmid is designated as SEQ ID NO:6. 

We obtained a rat parotid acinar cell line PARC 5.8 (Quissell et aL 1998) that we 

10 intended to use for transient expression of the phytase transgene. The purpose was to test the 
elEBciency of different constructs for transgene expression and also to detect any deleterious 
effects of phytase expression before introduction into the animals. We tried transient 
expression of the APPA gene using R15/APPA and R15/APPA+intron constructs but because 
of low transfection efficiency and/or low expression levels, we were unable to detect either 

IS phytase or P-galactosidase that we used as a control for transfection. 

We exchanged the R15-PRP inducible promoter from the R15/APPA construct with 
the SV40 constitutive promoter-CThancer, which enables higih level transient expression in 
different cell cultures. CHO, COS7 and HELA cell lines w^e soxened for transient 
expression of the APPA phytase usmg the S V40 promotei/mhancer- All cell lines were 

20 maintained on DMEM/F12 (Sigma) cell medium wifli 10 % (wt/vol) heat-inactivated fetal 
bovine serum at 37^C in 5% CO2 and 95% air. Cells were grown to 70 % confluence before 
transfection. Two hours before transfection the medium was ^changed with fresh medium. 
Cells were transfonned with 5 \x% of DNA per 60 mm culture plate (1:1 SWAQfAPPA and 
SV40/p-galactosidase) using the DNA-Calcium-Phosphate method of transfection (Gorman 

25 c/ aL 1983). After 6 hours of incubation the medium was removed and cells were subjected to 
glycerol shock for 3 min (Ausbel et al 1992). Cells were washed with phosphate-buffered 
sahne (PBS) and incubated in fresh medium under standard growth conditions. After 48 
horns of incubation cell-free culture fluid was collected, the cells washed two times with PBS 
and lysed with 1ml of 1% (vol/vol) NP-40, ImM disodium EDTA in Hanks balanced salts 

30 (HBSS) for 1 hour at 4^C. The phytase assay was performed in a final volume of 100 fil of 
O.l M sodium acetate/acetic acid buffer (pH 4,5) using sodium phytate (4 mM) as a substrate 
at 37^C. After 6 hours of incubation the reaction was stopped with 67 |il ammonium 
molybdate/ammoniunrranadate/nitric acid nuxture and the concratration of liberated 
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inorganic phosphate determined at 405 niri (Engelen et al. 1994). One unit (U) of mzyme 
activity was the amount of the enzyme releasing 1 (imol inorganic phosphate p^ minute. The 
assay was perfonned in triplicate. As a control for adogenous phytase activity, non- 
transfected cell lines were used. 
5 We did not detect endogenous phytase activity in non-transfected cell lines. Phytase 

activity was detected in all transfected cell lines, with C0S7 cells expressing a total of 0,35 U 
of phytase in cell-jfree culture fluid (4 ml) and 0.0034 U in the cell fraction (1.1 ml) obtained 
from the same plate. The phytase activity produced by C0S7 cells was 7 times higher than 
that of CHO and 35 times more than the HEIA cell line. More than 99% of activity was 
10 located in cell-free culture fluid, which suggests that the expressed enzyme was exported out 
of the cell using the bacterial signal sequence. We were imable to detect expression of 
cytoplasmic P*galactosidase, which we wanted to use as a control for transfection efficiency. 

3) Expression of R15-PRP/AFPA in Transgenic Mice 

15 Transgenic mice were generated using the constructs RIS/APPA and 

R15/APPA+intron by Dr. CA. Pinkert at the NICHD Transgenic Mouse Development 
Facility (NTMDF), University of Alabama at Binningham, Alabama. The procedures 
followed in generating the mice have been standardized by the NTMDF and further 
information concerning this can be obtained at: http://transgenics.bhs.uab.edu/pageLhtnL the 

20 content of whidi is incorporated herein by reference. This procedure involved the 

microinjection technique for transfecting mice with the desired nucleic acid sequence. To 
summarize, the sequences are microinjected into mouse zygotes and the surviving eggs are 
implanted into pseudopregnant recipient mice. The recipient mice then give birth to the 
resulting founder transgenic mice. It will be appreciated that various other methods of 

25 generating transgenic mice may be used in the present invention. 

The R15/APPA transgcne in mice was detected by PGR using the primers CAT-UP 1 
and APPA-DOWN2 that gives rise to a 700 bp fragment using the standard PGR conditions, 
except that the hybridization step was set at 51°C for 40 seconds and the polymerization step 
was at 72^C for one minute. 

30 For the R15/APPA construct 8 PGR positive founder mice were obtained of which 4 

were males and 4 wm females. Three of the founders did not pass the transgene to progeny 
and were probably mosaics. For Rl5/APPA+intron 5 PGR positive founder mice were 
obtained, 3 were males aG{l 2 were females, and one of them was found to be mosaic. At 10 
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to 12 weeks of age PRP production in the PGR positive progeny from differrat lines was 
induced for 10 days by daily intraperitoneal (ip) injection of Img isoproterenol dissolved in 
100 fil sterile saline. To serve as a control several PGR negative progeny were also mduced. 
No significant differences in weight were noticed between PGR positive and PGR negative 
progeny at either the beginning or end of the induction period. Saliva was collected before 
induction and at the end of the 10 day induction period. 

To collect saliva, mice were lightly anesthetized with a ketamine/xyiazine mixture (ip 
injection of 50 mg ketamine and 5 rag xylazine per kg body weigjit diluted in water) and 
saliva flow was induced by injection with pilocarpine/isoproterenol (ip injection of 0.5 mg 
pilocarpine and 2 mg isoproterenol per kg body weight dissolved in saline) (Hu et aL 1992). 
Between 100-250 \il of saliva was collected from each mouse over a 30 min period beginning 
5 min after the pilocarpine/isoproterenol injection. 

The saliva was collected from each mouse by holding it in one hand and withdrawing 
saliva from the comer of the mouth with a 20 fil pipetter. Collected saliva was transferred to 
a cold Eppendorf microcentrifiige tube containing 2 [il of 0.5 M EDTA (pH 8.0) and 4 \il of 
10 mg/ml protease inhibitor Pefabloc (Boehringer Mannheim) dissolved in water. The tubes 
with saliva were kept on ice until assays were conducted Phytase activity in the saliva was 
assayed as described for the S V40/APPA expressed in cell culture. 

Phytase expression was not detected in either un-induccd or in induced PGR negative 
mice. For PGR positive mice, phytase expression was not detected in those that were un- 
induced. However, phytase expression was observed for PGR positive mice that were 
induced. The results of this study arc summarized in Table 1. 

Even thou^ it was possible to distinguish saliva from induced PGR positive from tiiat 
of PGR negative mice in a phytase assay by a characteristic yellow color, saliva from some of 
the negative mice, when assayed, produced cloudiness tiiiat was impossible to remove by 
centrifugation and that a£fected spectrophotometer readings. We did not notice any grader 
(UfTerences in expression, both males and females were found to produce phytase in saliva. 
In tiiree lines (all R15/APPA+intron) no phytase expression or very low level of expression 
(0.03-0.95 U/ml) was detected, in 4 lines the level of expression ranged from 7 to 87 U/ml, 
and two lines (both R15/APPA) produced very high levels of phytase in saliva, 252 and 547 
U/ml 

These experiments demonstrated that phytase can be expressed at a very hi^ level in 
the salivary glands of miceywithout detrimental effects on the animals. We also were able to 
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produce progeny with an inducible salivary phytase from animals expressing the inducible 
phytase thereby documenting inheritance of the trait, and showing that the rqjroductive 
capability of animals was not affected. When the F2 generation of mice were tested for 
salivary phytase the level of phytase production was preserved. 
5 Founders containing the transgene without the intron gave offspring that produced 

significantly higher levels of phytase. The SV40 intron in the R15/APPA+intron construct 
seems to cause a lower level of expression, and in three lines (Alf, A20f and BOm) the level 
of phytase was barely detectable. The level of phytase expression in A2m Une 
(RlS/APPA+intron) was 6,2 times lower than that of the BOm-intron line (R15/APPA). 
1 0 Preliminary experiments showed that when the enzyme was analyzed by PAGE its 

size was increased from 42 kDa to 60 kDa, It is likely modified by glycosylation, but stable 
and active. 

in APPA Gene Under Control Of A Constitutive Promoter 

15 

1) Construction of the Lama2/APPA Transgene (Constitutive Promoter) 

The murine parotid secretory protein (PSP) is the most abundantly expressed protein 
in the parotid gland of mice Q^adsen and Hjorth 1985). Aftcc an hour of pulse labeling, the 
mouse parotid gland incorporates 65 to 85% of '^C-leucine into this single protein (Owerbach 
20 and B^orth 1980). It was estimated that PSP mRNA accumulates up to 50,000 molecules per 
cell and that from 3 to 5 molecules of PSP are produced for every molecule of amylase 
(Madsen and Hjorth 1985). Despite the predominance of the PSP in saliva its function is not 
well diaracterized. 

The single-copy gene coding for PSP has been cloned and characterized. It has two 
25 alleles PSP* (Shaw and Schibler 1986) and PSP^ (Owerbach and I^orth 1980). ThePSP^ 
allele is also expressed in the sublingual gland, but at I/IO of the level found in the parotid 
gland It was shown that 4.6 kbp of 5* flanking sequence of PSP^ is sufficient for salivary 
gland specific expression. The level of sublingual expression ^proached 100% of the PSP 
mRNA level, whereas the parotid expression did not exceed 1% (Mikkelsen et aL 1992), 
30 which demonstrates that regulatory sequences for sublingual and parotid expression are not 
identical. The level of expression was also dependent on the site of integration. The same 
construct was used for expression of the C-tcrminal chain of the human blood coagulation 
factor Vlll, FVlll. A high level of FVIU mRNA was detected in the sublingual gland and a 
low level in the parotid gland. The transgenic lines also secreted the FVlll light chain into 
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saliva at the level of about 10 units per salivation (about 0, 05 nil of saliva) (Mikkelsen et 
al,,1992). Later the same group achieved a high level of parotid-specific expression that was 
similar or even exceeded that of the endogenous gene by using 1 1.4 kbp of 5' flanking 
sequences and 2.5 kbp of 3'flanking sequences (Larsen et ai 1994). The expression also 
5 seems to be position-independent and copy-number-dependent that could indicate the 
presence of a LCR in these sequences. 

Lama 2 is a portion of the PSP gene and comprises an 18 kbp construct that is 
expressed in transgenic mice at up to 56% of the endogenous PSP gene. 

Because a large part of Lama 2 had not been sequenced, the construct was first 
10 disassembled and subcloned into pBluescript KS(+) and aflCT incorporation of the APPA 
gene, the Lama 2 was reassonbled back (Figure 3). We used unique enzymes Rsrll and 
Smal to remove a 3.4 kbp fragment from Lama2, which was subcloned into the multiple 
cloning site (MCS) of pBluescript II KS(+) that was previously digested with B^pnl and 
Smal, using a Kpnl-Rsrll adapter (Figure 3, step 1). 
15 Kpnl* RsrO 

TGGGAGGTCG 
CATGACCCTCCAGCCAG 

That allowed us to preserve the RsrII (CG/GWCCG) site and destroy the Kpnl site 
(GGTAC/O GGTAC/T), which would otherwise interfere with future cloning. The 
20 pKS/Lama construct was digested with Apal and Kpnl and used in a three-way ligation with 
the modified APPA (Figure 3, step 2). We designed two PSP/APPA constructs. One 
construct APPA-signal/APPA (Figure 3, steps 3a-7a) had the original bacterial signal 
sequence from the APPA protein having the following amino acid sequence: 

25 Met-Lys-Ala-nc-Lcu-ne-Pro-Phe-Lcu-Scr-Leu-Leu-nc-Pio-Leu-^ 
Ala 



We also modified a sequence near the ATG codon to resemble the optimal 
manmialian Kozak sequence (GCC GCC A/GCC ATG G) (Kozak 1987), but we did not 
30 mutagenize the +4 position because it would change Lys to Glu in the signal sequence witfi 
possible deleterious consequences for protein export. This optimized sequence was used in 
our previous construct R15/APPA and led to high levels of phytase production. We checked 
tiie APPA bacterial signal sequence using the PSORT computer neural networic trained on 
eukaryotic signal sequences and further described at http://psort,nibb.ac.jp:8800/ (Nakai and 
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Kanchisa 1992). The APPA bacterial signal sequence was recognized as an efficient leader 
peptide and the cleavage site was correctly predicted. PSORT also predicted that th^e Is a 
high probability that phytase would be exported correctly outside of the cell Th«re were also 
publications showing that some bacterial signal sequences migjit function efficiently in 
5 manunalian cells (Williamson et al 1994) (Hall et al 1990). Our experiments using cell 
culture demonstrated that the APPA signal was correctly processed with export of phytase 
outside of the cell. 

Experiments using cell culture cannot predict the direction of export and if phytase 
were exported into blood vessels instead of salivary ducts that could lead to deleterious 
10 effects. That is why we also designed a second construct PSP-signal/APPA (Figure 3, stq}S 
3b-7b) that would preserve the original PSP signal amino acid sequence: 

Met-Phe-Gbi-Uu-Gly-Ser-I^u-Val-Val-I^u-Cys-Gly-Leu-Leu-fle-Gly-Asn^ 



15 This leader peptide was also efficiently recognized by PSORT with the correct 

cleavage site (Nalcai and Kanehisa 1992). In this construct we also preserved the original 
PSP sequences near the ATG start codons, which may not be optimal, but could be important 
in regulation of gene expression. The APPA gene for both constructs was anq>lified by PGR 
using as the template our previous transgenic construct RIS/APPA that possessed the optimal 

20 Kozak sequence and the modified codons for residues AlaS, Pro428 and Ala429 as described 
earlier. For the APPA signal/APPA construct two synthetic primers w«re used wb^dlx 
introduced a Clal site near the ATG codon (APPA-CLA) and a I^nl site near flie TAA stop 
codon (APPA-KPN). The APPApcrI product was digested with Clal and JUpnl. The Clal 
site was also introduced into Lama 2 using pKS/Lama 2 as teaq>late for PCR. LAMA-UP 

25 primer was located upstream of ^al site and tiie LAMA-CLA primer introduced die Qal 
site near ATG codon (Figxirc 3, step 3a). Lamapcal product was digested with Clal and 
Apal (Figure 3, step 4a). pKS/Lama (Apal-Kpnl), LamapcRl (Apal- Clal) and APPApcrI 
(Clal-Kpnl) were combined together in a three-way ligation reaction (Figure 3, step 5a). 
The recovered pKS/Lama/APPA plasmid was digested with RsrII, Smal and inserted back 

30 into Lama2 (Figure 3 , step 6a). 

For the PSPsignal/ APPA construct, the synthetic APPA -KPN primer was used with 
the synthetic APPA -MATURE prirn^^ which produced phytase without a signal sequence. 
The APP Apcr2 product was blunt-ended using T4 DNA polymerase and digested vdth Kpnl . 
The PSP signal sequence was produced using the LAMA-UP and LAMA -SIGNAL primer 
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(Figure 3, step 3b). The LamapcaZ was blunt-ended using T4 DNA polymerase and digested 
with Apal (Figure 3, step 4b). pKS/Lama (Apal-I^nl), LamapcR2 (Apal-blunt) and 
APPAi»cr2 (blunt-Kpnl) were combined together in a three-way ligation reaction (Figure 3, 
step 5b). The recovered pKS/Lama/APPA plasmid was digested with Rsrll, Smal and 
5 inserted back into Lania2 (Figure 3, step 6b). 

Even though both constructs were successfully produced we decided to use 
Lama2/APPAsignal/APPA for the generation of transgenic mice, because we have results 
from our previous transgenic constructs R15/APPA and R15/APPA+intron which 
demonstrated that phytase with optimized Kozak sequence and the APPA signal peptide was 
10 synthesized at a high level in salivary glands after induction and was efficiently exported into 
the salivary duct. The Lama2/APPA vector was digested with Xhol and NotI, and the 
transgaie was geUpurified and furth^ purified using a Qiagen column (Figure 3, step 7a). 

2) Sequence of the Laraa2/APPA Construct 

A large segment of the Lama2 construct (Laursen and Hjorth 1997) used for 

15 construction of the Lama2-APPA transgene had not been reported in GenBank prior to our 
research. To ensure that we could more clearly describe the transgene construct, and 
fuithermore to avoid the introduction of deleterious DNA sequences from the mouse into the 
pig in the process of generating transgenic pigs, we sequenced the Lama2-APPA plasmid on 
both strands. Figure 4 illustrates schematically the structure of the Lama2-APPA plasmid. 

20 Figure 5 illustrates the nucleic acid sequence (SEQ ID NO: 1) of such plasmid The full 
transgene sequence was reconstructed fix>m overlapping DNA sequences using the Contig 
Assembly Program (CAP) flittp://hercules.tigenLit/ASSEMBLY/assembleAtmn developed 
by Huang ( 1996; 19S)9) and then inq>ected manuaUy for sequencing errors. The transgrae 
sequence was checked for the presence of interspeised iq>etitive elements using the computer 

25 program Rq>eatMa^cer (Smith and Green, RepeatMasker at 

ht^://fip.genome.washington.edu/c^-bin/Rq>eatMasker). It was found that 26 % of the 
transgene sequence was composed of repetitive elements (Table 2). However, such repetitive 
elmients are widely present in all mammalian genomes. For example, up to 50% of the 
human genome is derived from repetitive elements (Smit 1996; Kazazian 1998). 

30 Figure 23 illustrates the nucleic acid sequence (SEQ ID N0:7) of the Lama2/APPA 

transgene construct. 

The Lama2 high level expression cassette (Laursen and Hjorth 1997) contains the 
enhancer region and the promoter oTthe Psp gene in the parotid gland High expression ^'as 



RECTIFIED SHEET (RULE 91) 



wo 00/64247 



22 



PCT/CAOO/00430 



shown to be dependent on regulatory elements between -11.5 kb and -6,5 kb and/or between 
+8.3 kb and +10.9 kb. Svendsen et al. ( 1998a) showed that a 1.5 kb sequence between -3. 1 
kb and -4.6 kb had properties of a parotid and sublingual specific enhancer and was 
designated as the PSP proximal enhancer. Furthermore, they showed that transgcnes 
5 containing the PSP promoter and 5' flanking region located between —3.6 kb and -4.3 kb 
contained sequence information necessary to direct salivary gland specific expression. 

Screening the transgene with RcpeatMasker did not reveal the presence of any full- 
length active autonomous elements. The repeats present were extensively modified by 
inserdons and deletions. The btastx program was also used to compare the transgene 

10 sequence translated in all reading frames against the National Center for Biotechnology 
Information (NCBI) protein sequence database fhttp://www.ncbi.nlm.mh.gov/BLASTO 
(Altschul ei al. 1990;Gish and States 1993;Terada and Nakanuma 1993). A region of DNA 
from 861 to 2180 was found that might code for parts of a protein vntti limited homology 
(38<58% identities) to the C-terminus of several human and mouse reverse transcriptases. 

15 However, the region was extensively modified by mutations with multiple firame shifts and 
inversions, and probably represented renmants left from the reverse transcriptase gene of a 
LINE element It is unlikely that it would be active, due to extensive modifications in the 
amino acid sequence such that only 18% of the full reverse transcriptase sequence was 
present and the highly conserved amino acid motif (Y/FXDD) was absent fit>m the sequence 

20 (Xiong and Eickbush 1990). The complete sequence was also scanned for the presOTce of 
open reading frames (ORFs) that code for proteins usuig the program GENSCAN 
(httpy/CCR-08 1 >mit.cduyGENSCAN.html) (Burge and BCarlin 1997)- Only one gene was 
found and it corresponded to the APPA phytase gene. GENSCAN unexpectedly predicted a 
different N-terminus for the phytase than would have been expected Goom the sequence. 

25 However, that could have resulted fiom the lower accuracy of GEhlSCAN for detecting 
initiation sites (Burge and Kariin 1998). 

3) Generation of Transgenic Mice Expressing a Constitutive Salivary Phytase 

In the following description, a pair of founder mice, incorporating the phytase gene 
and a constitutive promoter, were prepared under contract by the University of Alabama. As 
^ 30 will be discussed, these founders were used to produce o£&pring, which were then analyzed 
for the presence of the phytase gene by PGR and animals containing the gene were thra 
tested constitutive salivary phytase production. 
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Two transgenic founder mice (a black male and a white female, 3-1) containing the 
phytase transgene were received from the NICHD Transgenic Mouse Development Facility 
at the University of Alabama. The black male was negative for salivary phytase, but the 
female, 3-1, exhibited a salivary phytase activity of 30 U/ml. Progeny produced by crossing 
5 the black male with 4 CD-I females produced 9 out of 25 females and 13 out of 26 males that 
were PGR positive. All progeny were negative for salivary phytase. The female founder, 3- 
1, was out-crossed with a CD-I male to produce 3 Utters for a total of 35 offspring. Of the 
progeny from these matings one phytase positive Gl male was obtained. When the Gl male 
was outcrossed with 6 CD-I females, of the 6 litters 20/34 males werePCP, positive and 
10 salivary phytase positive and 21/28 females were PCR positive and salivary phytase positive 
(Table 3). The salivary phytase activity of different offspring from the same first generation 
(01) male ranged from 1 .3 to 7L2 U/ml. There was no significant difference in the phytase 
activities between male or female mice. 

PCR assays for identification of the transgenic mice were carried out with an initial 
1 5 heating step at 95^C for 3 min, 40 cycles using 95°C for 30 sec, 54'*C for 30 sec and ll^'C for 
1 min) using the following primers: APPA-UP2 and APPA-KPN (Figure 6). 

The phytase assays were conducted as described above for the R15-PRP/APPA 
phytase expressed in cell culture. 

20 4) Production of Transgenic Pigs Containing the Phytase Traasgene Lama 2/APPA 

Transgenic pigs produced using Yorkshire and Yorkshire^Landrace ooss gilts as 
the embryo donors and Yorkshire sows as the recipients. The experimental procedure used 
was similar to ttiat described by Wall et aL ( 1985). The detailed procedure is described 
below. The Lama2/APPA construct with the APPA signal peptide was used as the transgene 

25 fornuccoiiyection. 

Methodolopv for the generation of transgenic pigs 

The following is a description of the preferred method of generating transgenic pigs 
according to the bvention. However, it will be apparent to those skilled in the art that 
various other methods are also i^plicable. 

30 

a) Superovulation of prepuberal gilts and sows. 

Selected Yoricshire or Yorkshire/Landrace cross gilts between 70 to 80 kg were 
superovulated by intramuscular injectioirof 2000 lU of pregnant mare's serum gonadotropin 
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(PMSG, Ayerst Veterinary Laboratories), followed by 700 lU human chorionic gonadotropin 
(HCG, Ayerst Veterinary Laboratories) 60 to 72 hours later, administered in the same 
manner. The gilts were artificially inseminated three times with a 16 hour interval between 
inseminations using semen from a high breeding index Yorkshire boar. Twenty-four hours 
after the last insemination, the gilts were slaughtered and the reproductive tract recovered. 

Synchronization of estrus in recipients 

Estrus was synchronized in experienced recipient sows as described for donor sows. 
Sboe synchronization and not superovuiation was the goal, hormone levels were reduced to 
500 lU for PSMG and 500 lU for HCG. PMSG was given the day the sow's litter was 
weaned, followed in 72 hours by HCG and surgwy for embryo transfix was performed 54 
hours thereafter. 

c^ Embryo collection 

Reproductive tracts w^ collected at the abattoir, inserted into bags, sealed and the 
bags immersed in water at 39'*C for transport to the laboratory. Recovery of the embryos and 
microinjection with the transgene was conducted in a laboratory maintained at 32 to 33**C. 
The oviducts wo-e dissected from the tracts and flushed, using a syringe and a feeding tube, 
with 15 ml of pre-warmed HBECM-3 medium (Dobrinsky et al 1996). The media was 
collected in a 100 mm Petri dish and placed in an incubator at ZZ.S^C with an atmosphere of 
5% (vol/vol) of CO2, 5% (vol/vol) O2 and the balance N2. After all tracts were flushed, 
embryos were individually collected from the flushed media using a polished transfer pipette. 
Embryos were rinsed twice in 3 ml volumes of pre-incubated BECM-3 and placed in 100 fil 
of pce-incubated BECM-3 under 3 ml of filter sterilized mineral oil until injected. 

Si Pronuclear injection 

Embryos from one gilt were collected and placed in one ml of pre-warmed HBECM-3 
in a 1.5 ml cratrifixge tube and centrifiiged for 6 min at 14,000 x g (Wall a/. 1985)- The 
embryos were then collected and placed in an injection dish with 40 jil of pre-warmed 
HBECM-3 covered with 2,5 ml of filter sterilized mineral oil. The pronucleus in each 
embryo was injected (Gordon ei al, 1980) with three picolitres of Lama2/APPA DNA in 
solution at a concentration of 5 ng of DNA per fil in 10 mM Tris, pH 7.5, 0. 1 mM EDTA. 
After injection, the embryos were placedi© dishes containing 100 \i\ of pre-incubated 
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BECM-3 under 3 ml of filter sterilized mineral oil. AAer all embryos were injected, which 
took no more than 4 hours since collection of reproductive tracts, the embryos were 
transferred to 1.8 ml cryotube (Nunc) containing 1 ml of pre-wanned HBECM-3 and 
transported in an incubator at 38.5°C to the swine surgery. 

5 

e) Embryo transfer 

Recipient sows were anesthetized by intravenous injection of 500 mg Brietol and 
anesthesia maintained by inhalation of 3% halothane with 4 litres per min of nitrous oxide 
and 4 litres per min oxygen. The oviducts were exposed through a laparotomy, just off the 
10 dorsal midline, and a catheter, containing 20 to 35 injected embryos and 3 to 6 untreated 

embryos, was passed into the infimdibulum and down the oviduct to the isthmus and raiptied. 
The oviduct was returned to the abdominal cavity and the incision closed. 

fl Growth of pies 

1 S New-bom piglets were kept together until weaning. At that time males and females 

were sqsarated and penned with non-transgenic same sex pigs of a similar age fix>m other 
litters. The pigs are fed libitum starts rations until 25 kg wt , grower diet fiom 25 to 60 
kg wt and finisher diet finom 60 kg to market weight Water is available ad libitum. 
Transgenic pigs 167-02, 282-02 and 282-04 were maintained on a low phytate ration until 85, 

20 50, and 50 days of age, respectively, and then switched to the grower ration. All other 
transgenic pigs were given the standard high phosphorus diets. 

The diets were provided as pelleted formulations during the weanling, grower and 
finishing phases are shown in Tables 4 and 5. The vitamin and mineral mixes included in the ' 
diets are shown m Tables 6 and 7. 

25 

PCR analysis 

Tail segments from newborn piglets were collected and slices of each placed in 600 (il 
of SO mM NaOH and heatmg at for 95^C for 15 minutes. The susprasion was neutralized 
with 50 fil of 1 M Tris (pH 8.0) and insoluble materials removed by centiifiigation for 5 min 
30 in a microcentrifuge. A 2 jil sample of each was used for PCR with primers APPA-UP2 and 
APPA-KPR 

The primers produce a 750 bp fiagment if the transgene is present As a positive 
control PIG-BGF and PIG-BGR primers were used to detect the porcine p*globin gene from 
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the same DNA preparation (Heneine and Switzer 1996). The PGR reaction was perfoimed 
using the same conditions as described for detection of the phytase transgene. As a negative 

control genomic DNA from a non-transgenic pig was used in the PGR reaction, for a positive 
control this DNA was spiked with a known amount of transgene (1 gene copy/per genome). 
5 When a positive signal was identified by PGR for pig 167-02 (Figure 3) another DNA 

preparation was made and two more pairs of PGR primers were used to test for gene integrity 
(Figure 4) APPA-MATURE with APPA-KPN, and APPA-MATURE with APPA-D0WN2 
PGR conditions were similar to those described previously, 

10 Extraction of DNA from blood for PGR analysis 

The method for extraction of DNA fix)m blood was based on a method described by 
Higuchi ( 1989) with some modifications. A 100 pi volume of whole blood was mixed with 
200 fil of lysis buffer (10 mM Tris-HGI, 0.32 M sucrose, 5 mM MgCh, 1% (vol/vol) Triton 
X-100, pH 7.50, mixed briefly and incubated on ice ifor 5 min- The sample was then 

1 5 cmtrifuged at 1 4,000 x G for 3 min, and the supemate discarded. The sediment was 

suspended in lysis buffer, mixed, incubated and centrifuged. This procedure was repeated 2 
more times, or until no hemoglobin remained- The sediment was dissociated in 100 fd of 50 
mM NaOH, mixed and heated at lOO^^G for 10 min. The contents were cooled, 10 jd of 1 M 
Tris-HGl (pH 8,5) added and mixed briefly. The sample was then centrifiiged at 14,000 x g 

20 for 2 min and 2 \xl of the supemate used for analysis by PGR 

The PGR reaction mixture with a total volume of 40 ^1 consisted oft 23,8 \i\ of 
distiUed water, 4 of 10 X Gibco BRL PGR buffer, 1 .2 of 50 mM MgGli, 0,8 jd of 10 
mM dNTPs, 40 pmol of eadi of the forward and reverse primers in 8 ^1, 2 ^1 of template 
DNA and 02 ^1 of Taq DNA polymerase (Gibco BRL, 5 U/fd). The amplification procedure 

25 was performed with an initial heating step at 95**C for 3 min followed by 40 cycles of 95*C 
for 30 sec. 54**C for 30 sec and 72*C for 60 sec. 

The transgenic pigs were detected with primers for the ilPP^ gene (APPA-KPN with 
APPA-UP2), and as a control PIG-BGF with PIG-BGR primers were used for detection of 
the porcine P-giobin gene. 

30 

Saliva collection fiom pigs for ohvtase assays and weighing of pigs 

Weanling pigs were sampled for salivary phytase by wiping under the tongue with a 
cotton tipped applicator, breaking the stick offmid centrifiiging the 2q}plicator tip in a 0.4 ml 
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micFOcetitrifuge tube, with a hole in the bottom, contained within a 1.5 ml miciocentriiuge 
tube. Grower and finishing pigs were sampled using 1.5 inch long #2 dental cotton absorbent 
rolls (Ash Temple Sundries Ltd, Don Mills, ON) attached to dental floss. These were 
centrifiiged in 1.5 ml microcentrifuge tubes with holes in the bottom while contained in larger 
5 tubes. The saliva was collected from the larger tube and stored at ~20'*C until analyzed. 
Saliva was collected and pigs were weighed at weekly intervals. 

Analysis for phvtase activitv. 

Saliva samples were either assayed directly or after dilution in O.i M acetate buffer 

10 pH 4.5, Phytase was assayed in 200 \il of 0.1 M sodium acetate buffer (pH 4.5) using sodium 
phytate (4 mM) as a substrate at 37^C. After 10 min of incubation the reaction was stopped 
by addition of 133 ^1 ammonium molybdate/ammonium vanadate/nitric acid mixture and the 
concratration of liberated inorganic phosphate determined at 405 nm (Engelen, van der 
Heeft, Randsdorp, and Smit 1994). This and all other assays were performed in triplicate. 

15 One unit (U) of enzyme activity was the amount of the enzyme releasing 1 fxmol of inorganic 
phosphate per minute. 

Assays for salivary phytase and for phytase in blood samples were conducted as 
previously described for saliva samples. A reagent blank with blood added at the same 
concentration as the samples assayed was subtracted from the sample readings. 

20 

Collection of fecal materials and analysis for total phosphoms 

Fresh feces were collected from each pig during the grower and finisher phases. 
Samples were placed in aluminum trays closed with a wax p^er top and immediately frozen, 
and kept frozen until they were lyophilized for analysis. After lyophilization the samples 

25 wexie transferred to n>om conditions ovenuglit to reach equilibrium in mo The 
samples were sq>arately ground with a mortar and pestle until homogenous and sealed in 
plastic containers until analyzed ftutfaer. Dry matter content of samples was analyzed 
according to AOAC (Association of Official Analytical Chraiists (AOAC) 1984) by heating 
1 gram samples at 1 lO^C for 4 hours and cooling in a desiccator prior to weighing. To 

30 analyze total phosphorus content, samples were heated at 550**C in a muffle furnace and 10 
ml of 10 M HCl added and heated to boiling. The contents from each sample was 
quantitatively diluted to 250 ml with water and inorganic phosphorus content was measured 
by the method of Heinoen and Lahti (1981). 
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Purification of the E. coli produced phvtase and pig salivary phvtase 

The APPA phytasc was over expressed in E. coli strain BL2l(DE3) and the EDTA 
lysozyme extract fraction purified on DEAE-Sepharose and Sephadex-G75 as described by 
5 Jia et aL ( 1998). The pig phytase was purified by chromatography on DEAE-Sepharose and 
the band of enzyme eluted with a sodium chloride gradient was further purified by 
Chromatofocusing using a pH gradient from pH 4,0 to 7.0. 

SDS'PAGE analysis and Silver Staining 
10 Sodium dodecylsulfate polyacrylamide gel electrophoresis was performed using a 

10% gel as described by Laenmili ( 1970), except that protein in the sample buffer was 
heated at 70^C for 10 minutes. Samples w^e stained with silver as described by Nester^o 
etal.(1994). 

15 Preparation of a monoclonal antibody specific for the APPA encoded E. coli phytase 

Monoclonal antibodies spedfic to the E. coli APPA encoded phytase w^e prq>ared 
according to the procedures of Galfre and Milstein (1981). Briefly, two female Balb/c mice 
were immunized 7 times over a period of 59 days with a purified APPA enzyme preparation. 
Mouse spleens were harvested, and the cells therem fiised with an NS-1 myeloma cell line 

20 (Kohler and Milstein, 1976). Fused cells were selected for their ability to grow in media 

containing hypoxanthine, aminopterin, and thymidine (HAT). Western blotting and Enzyme- 
linked Immunosoibent Assays (ELIS A) were used identify those clones capable of secreting 
an antibody into the culture medium ttiat recognized q)itopes on both the E. coli and pig 
derived APPA enzyme. Clones secreting a desirable antibody were subcloned twice to 

25 ensure a pure culture of antibody secreting hybridomas. 

Production of Polyclonal Antibodies Against the Purified E, coli derived APPA Phytase 

Antibodies were prepared m two New Zealand White Rabbits by two intramuscular 
injections at difTorat sites in the thigh of 50 jig of purified Escherichia coli dOTved APPA 
30 phytase in 0.5 ml of a 1:1 mixture of phosphate-buffered saline (PBS) and Freund's Con^)lete 
Adjuvant This wasfoUowedbyrepeatinjectionsof20iigeachofphytaseina 1:1 ntixture 
of PBS and Freund's Incomplete Adjuvant on days 4, 19, 25, and 39. Blood was collected 
via heart puncture on day 42. The scrum was separated firora the cell firaction and used as the 
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source of antibodies. The basic procedures for antibody production are described in Harlow 
and Lane (1988), 

Western blotting 

Western blotting was performed as described by Towbin et al. (Towbin et al 1979). 

Deglycosylatioa of pig phytase was done according to protocols, Roche Molecular 
Biochemicals, with following modifications. Protein in 50 mM Tris (pH 8.0), 1 mM EDTA, 
1% SDS, 1%2-mcrcaptoethanol was denaturated by heating at 95** C for 3 nun. Than protein 
was precipitated with chlorofomi-methanol method (Wessel and Flugge 1984) and 
resuspcndcd at 100 pg/mL in 20 mM Sodium Phosphate (pH 7.2) with 1% Triton X-100. 
Complete deglycos^ation of 5 jig in 50 phytase was carried out overnight at 37^C using 
1 unit (U) N-glycosidase F, 1.2 mU O- glycosidase and 1 mU neuraminidase (Boehringer 
Mannheim GmbH). After incubation 0.5 jig of protein was run on the SDS gel. 

Staining of glycoproteins 

This staining was done using DIG Glycan Detection Kit (Boehringer Mannheim) 
according to manufacture instructions (O'Shannessy et al, 1987), 

Statistics on flie generation of transgenic pips 

The statistics on embryos recovered, microinjected and transfen-ed into donor sows is 
shown in Table 8. A total of 4147 embryos injected with the transgoie and 675 untreated 
embryos were introduced into 140 recipient sows with an average of 30 injected embryos and 
5 uninjected embryos. All offspring were tested for the prescaice of the transgene in tissue 
biopsy, in blood by PGR analyas, and by an assay for phytase activity in the saliva. 

Table 9 lists tte transgenic pigs that woe produced, ttidr biith dates, sex and salivary 
phytase levels. There were 3 1 pigs transg^c for the phytase gene out of 203 live piglets 
bom fiom embryos microinjected; These were detected by the presence of the gene in blood 
samples using die standard primw set, APPA-UP2 and APPA -KPN, but onfy 14 were 
detected by analysis of tail DNA preparations using the standard prima- set. When the 
negative samples were reanalyzed using the primer set LAMA-UPl and APPA--down4 
(Figure 8) a further 8 tail DNA samples were found to be positive. Purification of the tail 
biopsy DNA probably would have led to all being PGR positive for the phytase transgrae. 
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Characteri stics of the phvtase transgene in transgenic pig 167-02 

The application of PGR to detection of transgenic pigs is exemplified by analysis of 
litter 167 in which one of 7 piglets tested, including one that was stillborn and one that was 
crushed by the sow after birth, one live piglet designated 167^2 was identified as positive for 
5 the APPA gene by generation of a PGR product (Lane 2) of approximately 750 bps from the 
tail chromosomal DNA (Figure 7). No reairangements of the APPA gene were detected as 
documented by the positive PGR results using primers directed to the 3* region (lane 2) the 
whole gene (lane 3) and the 5' region (lane 4) of the APPA gene (Figure 8). 

^0 Salivary phytase and weight gain durin g growth of transgenic and non-transgenic penmates. 

Data on salivary phytase activity and weight gain are shown for five tiansgOTic pigs 
and for weight gains of their non-transgenic paimates in Figures 9, 10, 1 1, 12 and 13. The 
phytase activity in the saliva varied substantially from one sampling time to the next. This 
variability was attributed to a combination of environmental factors including whether the 
15 animal had just consumed food or water, and regulation of parotid and saliva secretion in 
relation to food and water consumption. The weigjit gains during growth of the five 
transgenic pigs was within the range of the weigjit gains of the normal non-transgenic pigs. 

With the exception of 167-02 the growth rate of the transgenic pigs was similar to that 
of the noa*transgenic litter mates. 

^® Phosphoms content in the fecal matmals from transgenic and non-transgenic nigs. 

The phosphorus content of finesh fecal samples 6om three of the transgenic founder 
pigs, 167-02, 282-02, 282-04, 405-02 and 421^)6 receiving weaning, grower or finisher 
ration is shown in Table 9. The phosphorus content of the feces of tibe transgwiic pigs ranged 
from L59 to 226Y^ while that of the non-transgenic penmates ranged fiom 1.61 to 2.76 %, 

25 The reduction in fecal phosphorus ranged fifom a maximum of 26% to a mmimum of 8%. In 
most cases the differences were at the99% level of significance. Tlie ages of the pigs at the 
time of fecal sampling and the corresponding phytase activities are shown in Figures 9, 10, 
1 1, 12 & 13. The rations fed contained a supplement of readily available phosphorus suitable 
for maximizing growth of non-transgenic pigs. Since the reduction in fecal phosphoixis is 

30 measured in transgenic pigs receiving a diet high in mineral phosphorus it is very likely that 
the fecal phosphonis would be substantially lower if the diet lacked mineral phosphorus. 
Under these conditions the phosphorus released fiom phytate would provide a substantial 
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proportion of the dietary phosphorus and little would reach the large intestine and be excreted 
in the feces. 

Transmission of the phvtase transecne (to be completed) 
5 When semen from the transgenic boar 167-02 was used to inseminate four Yorkshire 

gilts all four sows had litters in which 4 out of 8, 2 out of 9, 7 out of 8 and 2 out of 5 of the 
piglets were transgenic for the phytase gene (Table 1 1). The PGR data for litter 154 that 
documents the presence of the transgene is shown in Figure 14, All pigs containing the gene 
exhibited phytase activitjr in the saliva, and it ranged from 341 to 10,077 units per ml. Half 
10 of the transgenic piglets had salivary phytase activities of greater than 2000 units per ml. The 
specific activity of the phytase in the saliva ranged from 39 U/mg protehi to a hi^ of 706 
units/mg proteiit 

This data documents that the g^e was transferred and that the level of phytase 
expression observed in the founder was preserved in the first generation of pigs. Both male 
1 5 and female pigs at 1 1 days of age exhibited high phytase activity. 

Characteristics of the ohvtase enzyme synthesized in the salivary glands of the pig 

The phytase enzyme was purified to homogeneity fix>m R coli and from saliva 
collected from transgenic pig 167-02. Silv^ stains of the purified enzymes after SDS-PAGE 

20 are shown in Figure, 15. The E. coli demcd enzyme has a molecular mass of approximately 
45 M)a while fliat produced by the pig was about 55 kDa. The enzymes were also 
electrophoresed as before, transferred to nitrocellulose and stained for glycoproteins. The 
second part of Figure 15 shows that the pig APPA protein is glycosylated. Figure 15B shows 
that treatment of the pig phytase with deglycosyiation enzymes changes the size of the 

25 phytase fix)m 60 kDa to 45 kDa, an observation that confirms the glycosylated nature of the 
recombinant phytase produced in the saliva of ttie pig. 

The data in Figure 16 shows that the pig phytase is homologous with the £, Coli 
enzyme despite their difference in size. 

The purified pig phytase had and Vmgx values of 0.33 mM and 624 units per mg of 

30 protein, respectively, Golovan et aL (2000) previously reported the K„, and for the £. 
coli enzyme to be 0.63 mM and 2325 units per mg of protein. Thus the salivary phytase 
exhibits approxunately 25% of the activity of the E. coli enzyme. This reduction in activity 
may be due to glycosylation that either modifies the catalytic site of the enzyme or otherwise 
leads to the formation of an enzyme with lower catalytic activity. 
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The latter finding of the production of a jglycosjdated protein suggests a methoa*of 
producing such proteins using transgenic animals. Currently, although recombinant methods 
are available for producing proteins in host cells, it is often found that the mature peptide 
lacks the glycosylation normally associated with proteins produced by higher life forms. 
5 Insulin is an example of such protein. The findings of this study suggest that one means of 
producing the desired glycoproteins would be to genwate transgenic animals such as the pig, 
that have be^ transfonned, by known methods or the method described above, with a gene 
encoding the desired protein. When expressed by such animal, the subject protein would be 
produced and would undergo post-translational processing in the cell including the step of 

10 glycosylation. Thus, the invention contemplates a general method of producing such 
glycosylated proteins. Further, the invention contCTq)lates a method of producing 
glycosylated proteins through the expression m and isolation fi-om the saliva of an animal that 
has been transformed with a gene encoding such protein, and wherein such gene is operably 
linked to a saliva protein promoter or enhancer. 

15 Various methods are known in the art for the collection of glycoproteins ftom the 

parotid gland of the pig for various applications. For example, surgical techniques have been 
published by Desmy et al. (1972) for the collection of secretions fiom the parotid gland and 
submandibular salivary ducts. 

20 Test kit for detection of the APPA phvtase protein in nies 

The monoclonal antibodies produced against the APPA phytase expressed in R coli 
reacted with the APPA phytases produced in the saliva of transgenic mice and pigs (Figure 
17). Immimological detection of phytase in saliva provides definitive proof that the phytase 
secreted in transgenic pig saliva is a product of the APPA gene expressed in the pig salivary 
25 gland. This serves as a reliable method to document phytase production in transgenic pigs. 

A fiuther test would also be obtainable using the polyclonal antibodies discussed 

above. 

The DNA sequence encoding phytase may be obtained from a variety of sources such 
30 as microbial, plant or animal sources. Preferably, the DNA sequence is obtained fiom a 
microbial source such as bacteria. Most preferred DNA sequmces are obtained fi^m 
Escherichia coli. 

The cloning of a gene or a cDNA encoding a phytase protein may be achieved using 
various methods. One method is by purification of the phytase protein, subsequent 
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determination of the N-tenninal and several internal amino acid sequences and screening of a 
genomic or cDNA library of the organism producing the phytase using oligonucleotide 
probes based on the amino acid sequences. If at least a partial sequence of the gene is known, 
this infomiation may be used to clone the corresponding cDNA using, for instance, the 

5 polymerase chain reaction (PGR) (PGR Technology: Principles and Applications for DNA 
Amplification, (1989) H. A. Ehrlich, ed., Stockton Press, New York; the contents of which 
are incorporated herein by reference). It will be evident to those skilled in the art that the 
cloned phytase gene described above may be used in heterologous hybridization experiments, 
directed to the isolation of phytase encoding genes from other microorganisms. 

10 The DNAs encoding phytase or individual fiagments or modified proteins thereof can 

be fiised, in proper reading fi^e, with appropriate regulatory signal$ as described in detail 
below, to produce a genetic construct that is then amplified, for exaixq>le, by preparation in a 
bacterial (e,g., £, coli) plasmid vector according to conventional methods. Such methods are 
described in, for example, Sambrook et aL, Molecular Cloning: A Laboratory Manual (Cold 

15 Spring Harbor Press 1989), the contents of which are incorporated h^ein by reference. The 
amplified construct is fliereafter excised firom the vector and purified for use in producmg 
transgenic animals. 

The desired protein may also be produced as a fiision protein containing anotfa^ 
protein. For example, the desired recombinant protein of this invention may be produced as 

20 part of a larg«- recombinant protein in order to stabilize the desired protem. Useful 
modifications within this context include, but are not limited to, those that alter post- 
translational modifications, size or active site, or that fiise the protein or portions thereof to 
another protein. Such modifications can be introduced into the protein by techniques well 
known in this art, such as by synthesizing modified genes by ligation of overlapping 

25 oligonucleotides or introdudng mutations into the cloned genes by, for example, 
oligonucleotideHnediated mutagenesis. 

The cloned phytase gene may be used as starting materials for the construction of 
improved phytases. In^)n)ved phytases are phytases, altered by mutagenesis techruques (e.g. 
site-directed mutagenesis, or directed evolution), which have properties that dififer firom those 

30 of wild-type phytases (Kuchner and Arnold 1997). For example, die teanpOTture or pH 
optimum, specific activity, temperature or protease resistance may be altered so as to be 
better suited for a particular application. 

A choice of expression in cellular compartments (such as cytosol, endoplasmic 
reticulum) or extracellular expression can be used m die present invention, depending on the 
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biophysical and biochemical properties of the phytase. Such properties include, but are not 
limited to pH sensitivity, sensitivity to proteases, and sensitivity to the ionic strength of the 
preferred compartment. The DNA sequence encoding the enzyme of interest should be 
modified in such a way that the enzyme can exert its action at the desired location in the cell. 
To achieve extracellular expression of the phytase. the expression construct of the present 
invention utilizes a bacterial signal sequence. Althou^ signal sequences that are 
homologous (native) to the animal host species are preferred, heterologous signal sequences, 
i.e. those originating fiom other animal species or of microbial origin, may be used as weU. 
Such signal sequwices are known to ftose sldlled in the art 

All parts of the relevant DNA constructs (promoters, regulatory, secretory, stabilizing, 
targeting, or termination sequences) of the preseat invention may be modified, if desired, to 
affect flicir control charactedstics using methods known to (hose sldlled in the art. The cis- 
acting regulatory regions useful in the invention include the promote that drives expression 
of the phytase gene. Highly preferred are promoters that are specifically active in salivary 
gland cells. Among such promoters, highly preferred are mouse parotid secretory protein 
(PSP) promoter, rat proline-rich protein (PRP) promoter, human salivary amylase promoter, 
mouse mammary tumor virus promoter (Samuelson 1996). Among the usefixl sequences that 
regulate transcription, in addition to the promotes discussed above, are oihancers. splice 
signals, transcription termination signals, and polyadenylation sites. Particularly useful in 
this regard are tiiose that increase the efBciency of the transcription of the genes for phytase 
in the salivary gland or other cells of the transgraiic animals listed above. Preferred are 
transcription regulatory sequences for proteins highly expressed in the salivary gland cells. 
Introns could be introduced to increase levels of expression. Such introns include the 
synthetic intnm SIS. SV40 smaU t antigen intron and others (Whitelaw et aL 1991; Petitclerc 
etal, 1995). 

Preferably, the eiqpression system or construct of ftis invention also includes a 3' 
untranslated region downstream of the DNA sequence encoding the desired recombinant 
protein, or the saUvary protwn gene used for regulation. This region apparently stabilizes the 
RNA transcript of the expression system and thus increases the yield of the desired protein. 
Among the V untranslated regions useful in this regard are sequences that provide a polyA 
signal. Such sequences may be derived, e.g.. fiom the S V 40 small t antigen late 
polyadenylation signal, synthetic polyadenylation signal or other 3' untranslated sequences 
well known in this art (Carswell and Alwine l989;Levitt et al 1989). Preferably, the 3' 
untranslated re^on is derived from a salivary-specific protein. The stabilizing effect of fids 
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region's polyA transcript is important in stabilizing the mRNA of the expression sequence. 
Further, the addition of locus control regions (LCRs), matrix attachment regions (MAR) and 
scaffold attachment regions (S ARs) would allow position-independent, copy number 
dependent expression of the transgene with either homologous or heterologous promoters 
5 (Taboit-Dameron et al, 1999;Geyer 1997). Co-mtegration of an actively expressed gene with 
the transgene was also shown to increase expression levels of a poorly expressed transgene 
(Clark et al. 1993). Also important in increasing the efiSciency of expression of phytase is a 
strong translation initiation site (Kozak 1987). Likewise, sequences that regulate the post- 
translational modification of phytase may be useful in the invention- 

10 The term "animal" as used herein denotes all animals except humans. It also includes 

an individual animal in all stages of development, including embryonic and fetal stages. 

A ^'transgenic*' animal is any animal containing cells that bear gmsAc infoimation 
received, directly or indirectly, by deliberate genetic manipulation at tiie subcellular level, 
such as by microinjection or infection with a recombmant virus. "Transgenic" in the present 

1 5 context does not encompass classical crossbreeding or in vitro fi^lization, but rather denotes 
animals in which one or more cells receive a recombinant DNA molecule. Although it is 
highly preferred that this molecule be integrated within the animal's chromosomes, the 
invention also encompasses the use of extrachromosomally replicating DNA sequences, such 
as might be engineered into yeast artificial chromosomes. The inforrbation to be introduced 

20 into the animal may be foreign to the species of the animal to which the recipient belongs 
C-c., heterologous"), or the information may be foreign only to the particidar individual 
icdpicnt, or genetic information akeady possessed by the recipient ti the last case, the 
introduced gene may be expressed in a maimer different than the native gene. 

As indicated above, die transgenic animals of this invention are other than human. 

25 Farm animals (pigs, goats, sheep, cows, horses, rabbits and the Uke), rodents (such as mice 
and rats), domestic pets (eg. cats and dogs), fish and poultry (eg. chickens) are included in the 
scope of this invention. It is hi^y preferred that a transgenic animal of the present invention 
be produced by introducing into single cell embryos impropriate polynucleotides that encode 
phytase, or fragments or modified products thereof; in a manner such that these 

30 polynucleotides are stably integrated into the DNA of geim line cells of the mature animal, 
and are inherited in normal mendeiian fa^ion. Advances in technologies for embryo 
micromanipulation now permit introduction of heterologous DNA into fertilized mammalian 
ova. For instance, totipotent or pluripotent stem cells can be transformed by microinjection, 
calcium phosphate mediated precipitation, liposome fiision, retroviral infection or other 
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means, the transformed cells are then introduced into the embryo, and the embryo then 
develops into a transgenic animal. In one preferred method, developing embryos are infected 
with a retrovirus containing the desired DNA, and transgenic animals produced from the 
infected embryo. In a most preferred method, however, the appropriate DNAs are co-injected 
5 into the pronucleus or cytoplasm of embryos, preferably at the single cell stage, and the 

embryos allowed to develop into mature transgenic animals. Such techniques are well known 
(see reviews of standard laboratory procedures for microinjection of heterologous DNAs into 
mammalian fertilized ova, including Hogan ct al., Manipulating The Mouse Embryo, (Cold 
Spring Harbor Press 1986): Krimpcnfort et al., Bio/Technology 9:844 (1991); Paimiter et ai., 

10 , Cell, 41: 343 (1985); Kraemer et al,. Genetic Manipulation Of The Early Mammalian 
Embryo, (Cold Spring Harbor Laboratory Press 1985); Hammer et al., Nature, 315: 680 
(1985); Wagner et al., U.S. PaL No. 5,175,385; Krimpcnfort et al., U.S. Pat. No, 5,175,384, 
the respective contmts of which are incorporated herein by reference). 

For a person skilled in ait, it will also be clear that the preset invention provides for 

1 S ottier protdns to be expressed in the salivary gland of the pig. Such proteins may be secreted 
into saliva to improve digestion and decrease pollution potential (for example, 
endoglucanases), or q>e(nfically targeted for secretion into blood and have effects on the 
growdi and health of the animal (such as growth hormone). 

Phytase activity may be measured via a nimiber of assays, the choice of which is not 

20 critical to the present invention. For example, the phytase enzyme activity of the transgenic 
animal tissue may be tested with an ELISA-assay, Western blotting or direct enzyme assays 
using calorimetric techniques or gel assay system. 

The examples included herein are provided so as to give those of ordinary skill in the 
art a complete disclosure and description of how to make and use the invention and are not 

25 intended to limit the scope of what the inventors regard as ttieir invcntioiL Efforts have been 
made to ensure accuracy with respect to nimibers used (e.g., amounts, temperature, pH, etc.) 
but some experimental errors and deviation should be accounted fon Unless indicated 
otherwise, temperature is in degrees Centigrade and pressure is at or near atmospheric, 

30 

Althougji the invention has been described with reference to certain specific 
embodiments, various modifications thereof will be apparent to those skilled in the art 
without departing fi:om the spirit and scope of the invention as outlined in the claims 
appended hereto. 
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Table 1. Secretion of phvtase in the saliva of transgenic mice containing the R1S»PRP/SPPA 
transgene and non-transgenic mice induced with isoproterenol and pilocarpine. 



Founder Mice PGR Gender Generation Transgene Phytase activity 

micromoles/min/ml 



AOm 


4bfr(+) 


positive 


F 


1 


APPA+intoon 


39.73 


AOm 


2brm(+) 


positive 


M 


I 


APPA+intoon 


24.29 


AOm 


2bnn(+) 


positive 


M 


2 


APPA+intron 


14.42 


AOm 


5brf(+) 


positive 


F 


2 


APPA+intion 


7.36 


AOm 


lbnn(-) 


negative 


M 


1 


APPA+intron 


0.00 


Alf 


9brf(+) 


positive 


F 


1 


APPA+intron 


0.08 


Alf 


llw£(+) 


positive 


F 


1 


APPA+introa 


0.07 


Alf 


5bnn(+) 


positive 


M 


1 


APPA+intron 


0.03 


Alf 


10wf(-) 


negative 


F 


1 


APPA+intron 


0.02 


A20f 


lbnn(+) 


positive 


M 


1 


APPA+intron 


0.53 


A20f 


5brf(+) 


positive 


F 


1 


APPA+intron 


0.12 


A20f 


4brf(-) 


negative 


F 


1 


APPA+intion 


0.03 


A2m 


13wf(+) 


positive 


F 


1 


APPA+intron 


87.70 


BOm 


5btf(+) 


positive 


F 


1 


APPA+intron 


0.95 


BOm 


3brm(+) 


positive 


M 


1 


APPA+intron 


0.73 


BOm 


6wf(-) 


negative 


F 


1 


APPA+intron 


0.00 


BOf 


3wf(+) 


positive 


F 


2 


APPA 


252.43 


BOm-intr 9w^+) 


positive 


F 


1 


APPA 


546.74 


WOm 


8wfl[+) 


positive 


F 


1 


APPA 


60.42 


W30m 


lwin(+) 


positive 


M 


2 


APPA 


41.91 


W30m 


llwfl[+) 


positive 


F 


1 


APPA 


43.44 


W30m 


4wm(-) 


negative 


M 


1 


APPA 


0.02 


W30m 


lOwfO 


negative 


F 


I 


APPA 


0.02 
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Table 2. Repeat sequences found in the Laina2-APPA consU uui. 



Start 


aid 


DNA 
strand 


Repeat 


Class/^mily 


Substitu- 
tions % of 
consensus 


Deletions 

%of 
consensus 


Insertions - 

%of 
consensus 


765 


927 


-r 








4 2 


6.7 


928 


965 


+ 




oin^ic repeal 




n 


0 


966 


1020 


+ 


LlMl 


LINE/Ll 


25 


4.2 


6.7 


1021 


1156 




Bl MM 


SINE/Alu 




A 
U 


V 


1159 


1231 




CAAAC)n 


Simple repeat 


1 A 

1.4 


A 
U 


n 

V 


1232 


1385 




LlMl 


LINE/Ll 


25 


4.2 


6.7 


1652 


2308 


c 


LI 


LINE/Ll 


28«5 


li.v 


i./ 


2334 


2406 


c 


MIR 


SINE/MIR 


27.4 


A 1 

4.1 


A 
U 


2415 


3266 


+ 


RMER13A 


LTR 


17.7 


4 


4^ 1 
O.i 


6016 


6127 


c 


L1MA9 


LINE/Ll 


25.5 


2 


1 

I 


6831 


7007 


+ 


CT-rich 


Low 
complcTdty 


30.5 


1.7 


3.4 


7299 


7510 


c 


B3 


SINE/B2 


27.8 


7.5 


1.4 


7718 


7746 


+ 


(TCTCTG)n 


Simple repeat 


6.9 


0 


A 

U 


8499 


8581 


c 


MIR 


SINE/MIR 


24.1 


12.1 


3.0 


oniA 






Lx4 


LINE/Ll 


21.7 


6.4 


02 


10465 


10519 


+ 


(TG)n 


Simple repeat 


5.5 


1.8 


0 


11235 


11287 


c 


MER5A 


DNA/MERl 
type 


28.3 


0 


1.9 


12372 


12537 


c 


L1MA4A 


LINE/Ll 


28.3 


5.4 


0 


14240 


14388 


+ 


B1_MM 


SINE/AIu 


4 


0 


13 


14869 


14945 


c 


MIR 


SINE/MIR 


36.4 


1.3 


0 


16391 


16540 


c 


ORRID 


LTR/MaLR 


293 


0 


6 


16774 


17214 


+ 


RMER4 


LTR 


21J 


10 


11.8 


17229 


17718 


c 


LI MM 


LINE/Ll 


15.3 


0 


0.8 
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Table 3. Salivary phvtase activities of G2 mice from the founder female 3-1 
generated using the construct Lama2-APPA- The mice were between 21 and 30 
days of age. 



male mouse # 


Phytase (U/ml) 


female mouse # 


Phytase (U/ml) 


5 


28.3 


I 


9.0 


6 


2.5 


2 


29.9 


8 


6.6 


4 


8.0 


9 


44.7 


5 


43.0 


1 A 

I. \J 


12.7 


/' 
0 


26.9 


12 


28.3 


S 


1.9 


15 


28.1 


9 


66.3 


18 


71^ 


10 


19.9 


19 


19.5 


11 


61.3 


20 


15.7 


12 


36.4 


21 


20.9 


13 


18.0 


22 


4.1 


17 


38.9 


24 


13.0 


18 


18.5 


26 


53.4 


19 


27.0 


28 


20.4 


23 


6.5 


29 


34.1 


24 


16.1 


30 


11.1 


25 


9,4 


32 


3.1 


26 


14.8 


33 


51.7 


27 


1.3 


34 


19.0 


28 


8.2 
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Table 4> Composition and nutrient levels of Phase II starter diet and low phvtatc 
starter diets fed to weanling pies between 5-10 kg. 



Ingredients 


Diet/Nutiieat Levels' 




Phase n Starter Diet 


Low Phytate Starter Diet 


Com 


33.15 


25.44 


Barley 


8.00 


8.00 


Wheat 


20.00 


40.00 


Soybean meal 


21.00 


8.00 


Fish meal 


5.00 


5.00 


Meat and bone meal 


- 


1.00 


Whey 


8.00 


8.00 


Fat 


2.00 


2.00 


Lysine-HCl 


0.10 


0.28 


Dicalcium phosphate 


I.IO 


- 


CaCCb 


0.90 


1.10 


Iodized salt 


0.30 


0.30 


Vitandn premix* 


0.250 


0.55 


Mineral premix' 


0.10 


0.10 


Lincommix44 


0.10 


0.10 


Total (kg) 


100.00 


100.00 








Calculated nutritive values 






DE (kcal/g) 


3.44 


3,36 


CP (%) 


19.46 


18.62 


Ca(%) 


l.OO 


0.94 


Total P(%) 


0.74 


0.66 


Ca/P 


1.35:1 


1.42:1 


Total AA contents (%) 






Arginine 


1.16 


1.17 


Histidine 


0.50 


0.48 


Isoleucine 


0.81 


A 77 




1.58 


1.54 


Lysine 


1.17 


1.06 


Methionine 


0J4 


0.29 


Cystdne 


034 


034 


Mcthionine+Cystdne 


0.68 


0.63 


Phenylalanine 


0.90 


0.90 


Tyrosine 


0.65 


0.65 


Threonine 


0.75 


0.68 


Tryptophan 


0.23 


0.23 


Valine 


0.91 


0.86 



Minerals and vitamins meet or exceed levels recommended by NRC (1998). 
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Table 5. Composition and nutrient levels of grower and finisher diets. 



Ingredients 


Diet/Nutrient Levels 




Grower Diet 
For pigs 20 to 50 kg 


Finishing Diet 
Forpigs50tol20kg 


Cora 


5L78 


40.00 


Barley 


8.10 


23.03 


Wheat 


20.00 


23.00 


Soybean meal 


16.00 


13.00 


Fat 


1. 00 


1 r\r\ 


Lysinc-HCi 


0.12 


0.12 


Dicalcium phosphate 


1.20 


1.00 


CaCOa 


1 15 


1.15 


Iodized salt 


0 50 


0.50 


Vitamin pretnix' 


0 1 s 


0.15 


Mineral oremix' 


0 10 

V. XV/ 


0.10 


Total flee) 


inn no 


100.05 








Calculated nutritive values 






DE fkcal/c) 


3 39 


3.33 


CP (%) 


14.76 


14.17 


Ca(Vo) 


0.79 


0.74 


Total P (%) 


0.57 


0.53 


CayT? 


1.39:1 


1.39:1 


Total AA contents (%) 






Arginine 


0.86 


0.80 


Histidine 


038 


0.36 


Isoleucuie 


0J)8 


0.55 


Leucine 


1.28 


1.18 


Lysine 


0.78 


0.73 


Methionine 


0.24 


023 


Cysteine 


0.29 


0.29 


Methioninc+Cystcine 


0.53 


0.52 


Phen^dalanine 


0.70 


0.68 


Tyrosine 


0.50 


0.46 


Threonine 


0.52 


0.49 


Tryptophan 


0.17 


0.16 


Valine 


0.68 


0.65 



'Minerals and vitamins meet or exceed levels recommended by NRC (1998). 
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Table 6. Vitamin premix composition 



Nutrient 



Amount per S kg of premix 



Wheat midds 
Vitamin A 
Vitamin D 
Vitamin E 
Menadione 
Pantothenic acid 
Riboflavin 
Folic acid 
Niacin 
Thiamin 
Pyridoxine 
Vitamin Bi2 
Biotin 
Choline 



3,867 kg 
10 million lU 
1 million lU 
40 thousand lU 
2.5 g 
15 g 
5g 
2g 
25 g 
1.5 g 
1.5 g 
25 mg 
200 mg 
500 g 



•From Hof&nan-LaRoche Limited, P.O. Box 877, Cambridge, ON. N1R5X9 



Tabie 7. Composition of the mineral premix 



Mineral component 


Amount (%) 


Limestone 


43.3 


Goppo- sulfate (25%) 


6.0 


Ferrous sulfate (30%) 


33.4 


Zinc oxide (72%) 


13.9 


Manganous oxide (56%) 


3.4 



'Mineral premix prepared at Arkell 
^Dicalcium phosphate contained 18.5% caldum and 20.5% of 
phosphate and normally is added at a level of 1.2% to the pig 
grower diet, 1.0% to the fini^er (Uel and IJS% to tiie nursing sow 
diet. 
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Table 8. Statistics on cmbrvo recovery and the introducti on of embryos 
containing the transeene into recipient sows. 



Treatment dumber 

Gilts used for embryo recovery: 

Yorkshire 279 
Yorkshire x Landrace cross 168 
Duroc 12 
Total 459 

Recipient sows * 

Embryos transferred to recipients: 

Embryos microinjected with the transgene 4 147 

Uninjected canier embryos 675 
Total 4543 

Total number of embryo transfers ^ 140 . 

'Sows were used for up to three farrowings of potentially transgenic 

pigs. Sows were inseminated with Yorkshire semen from a high 

breeding value boars. 
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sinele cell zveotes 


usine the Laraa2-APPA transgene 




TD # of 

Li-J ft KJl. 


Rirth Date 


Presence of 


Sex 


Salivary phytase 
(U/mir 


Zygote source^ 


f\i or ^ 

pis 




Trans£tene^Tail/Blood 






1 /CT no 
10 /-U/ 


Anr 1A/0Q 

/vpr iHiyy 


+/+ 


Boar 


6,000 


Yorkshire 


982-02 


Jun 14/99 


+/+ 


Boar 


618 


Yorkshire 


282-04 


Jun 14/99 


+/+ 


Boar 


1,349 


Yorkshire 


405-02 


Aug 14/99 


+/+ 


Gilt 


339 


York/Landrace 


421-02 


Aug 24/99 


-/+ 


Gilt 


0.8 


York/Landrace 


421-04 


Aug24/99 


7+ 


Gilt 


2.2 


York/Landrace 


421-06 


Aug 24/99 


+/+ 


Boar 


97 


York/Landrace 


448-01 


Sq) 03/99 


+/+ 


Gilt 


0 


York/Landrace 


491-01 


Sen 25/99 


+/+ 


Gilt 


2.3 


York/Landrace 


491-02 


Sen 25/99 


+/+ 


Gilt 


0 


York/Landrace 


*tir 1— 


Sen 25/99 


+/+ 


GOt 


0.3 


York/Landrace 


491-05 


Sep 25/99 




Boar 


n 
U 


York/Landrace 


496-05 


Sep 26/99 


+/+ 


Boar 


U 


York/Landrace 


500-03 


Sep 28/99 


t / ■ 
+/+ 


Boar 


i Jo 


York/Landrace 


510-01 


Sep 28/99 


+/+ 


Boar 


ll.Z 


York/ 


559-05 


Nov 01/99 




Boar 




York/Landrace 


560-04 


Nov 02/99 


+*/+ 


Boar 


c 
3 


Yorkshire 


594-03 


Nov 18/99 




VJllt 


'J 1 


Yorkshire 


613-02 


Nov 27/99 




. Cnit 


n 

V.J 


York/Landrace 


613-03 


Nov 27/99 


-/+ 






York/Landrace 


647-01 


Dec 13/99 


-/-r 






York/Landrace 




Dec 13/99 


+*/+ 


Gilt 


16.3 


York/Landrace 


647-04 


Dec 13/99 


-*/+ 


Gilt 


0.5 


York/Landrace 


647-08 


Dec 13/99 


-*/+ 


Boar 


0.4 


York/Landrace 


647-09 


Decl3/99 


+*/+ 


Boar 


1.92 


York/Landrace 


668-01 


Dec 17/99 


+*/+ 


Gilt 


489 


Yorkshire 


671-02 


Dec 19/99 


+*/+ 


Boar 


6.9 


York/Landrace 


671-04 


Dec 19/99 


+*/+ 


Boar 


325 


Yoik/Landrace 


675-03 


Dec 21/99 


-*/+ 


Gilt 


2.1 


York/Landrace 


675-04 


Dec 21/99 


4-*/-f 


Boar 


42.6 


York/Landrace 


675-06 


Dec 21/99 


.♦/+ 


Boar 


5.0 


Yoric/Landrace 



'The number preceeding the dash rqiresents the littra: number and the number following the 
dash is the pig number within the Utter. 

^All PGR assays were conducted with the primer APPA-np2-APPA-Kpn. Assays indicated 
wth a star gave a negative result with flie primer pair. However these samples gave a positive 
result for the primer set APPA-d4-Lama-upl. Samples 613-02 and 613-03 were negative with 
the latter primer set. 

^Saliva was sampled and assayed for phytase 2 to 4 days after birth of the piglets. 

^gotes used for microinjection were collected from superovulated Yorkshire or Yoricshire- 

Landrace cross ^Its. 
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Table 10. Phosphorus content of feces collected from pigs producing a salivary phytase and 
non-transgenic pen-mates*. The data was subjected to a T-test analysis and the data recorded 
below. 





Mean Fecal 
Phosphorus 
(%) 


SE 


Relative reduction 
in fecal 
phosphorus (%) 


t 


t(l%) 


1. 167-02 Grower Diet (122 days): 


1.59 




24.47 






Non-transgenic (n=4) 


2.11 


0.0604669 




8.517 


4.6 


1. 167-02 Finisher Diet (154 
days): 


1.97 




16.97 






Non-transgenic (n=4) 


2.37 


0.0240767 




16.717 


4.6 


3. 282-02 Grower Diet (93 days): 


1.85 




12.90 






Non-transgenic (n=5) 


2.124 


0.022231964 




12.324 


4.03 


4. 282-02 Finisher Diet (145 

days): 


1.76 




16.03 






Non-transgenic (n=5) 


2.096 


0.099153384 




3389 


4.03^ 


5. 282-04 Grower Diet<93 days): 


1.95 




8.19 






Non-transgenic (n=5) 


2.124 


0.022231964 




7.827 


4.03 


6. 282-04 Finisher Diet (145 
days): 


1.56 




25.57 






Non-transgenic (n=5) 


2.096 


0.099153384 




5.406 


4.03 


7. 421-06 Starter II Diet (40 
days): 


1.17 




27.47 






Non-transgenic (n=5) 


1.612 


0.086155741 




5.140 


4.03 


8. 421-06 Start m Diet (48 days): 


1.57 




18.01 






Noa-transgenic (n=5) 


1.915 


0.102884789 




3351 


4.03 


9.421-06 Grower Diet (81 days): 


2.00 




13.28 






Non-transgenic (n=5) 


2310 


0.151658823 




2.022 


4.03 


10. 421-06 Finisher Diet (136 

days): 


1.71 




21.20 






Non-tiansgenic (n=5) 


2.173 


0.053023237 




8.687 


4.03 


11. 405-02 Starter n Diet (40 
days): 


1.81 




26.97 






Non-transgenic (n^=^ 


2.482 


0.173625623 




3.856 


4.03 


12. 405-02 Starter m Diet (48 

days): 


1.54 




36J8 






Non tiansgenic (n=4) 


2.430 


0.104642248 




8.496 


4.6 


13. 405-02 Grower Diet (80 days): 


2.26 




18.19 






Non-tcansgenic (n=4) 


2.763 


0.124724697 




4.029 


4.6 


14. 405-02 Finisher Diet (U6 
days): 


2.26 




13.24 






Non-transgenic (n=4) 


2.605 


0.217198066 




1.588 


4.6 


'Fresh fecal samples were collected on 3 different days was freese-dried and Haea dried to constant 
weight at 1 10°C for 24 h, and ana^zed for total phosphorus. 
*At the 5% level of confidence t«2.57. 
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Table 1 1. Phytase activities of the first generation (G\) transgenic offspring obtained bvflie 
crossing the phytase positive boar 167-02 with non-transgenic Yorkshire pilts' 



VJ tt OX pig 






o ail vary pnyiasc ^u/nuj 


opecinc Acuviiy 
Ur lug pruiciu 


K 1 fit 


Mar lo/uu 


r 


1 1 Q1 

1 lyj 




1 CI AO 

ij 1-02 


u 


r 


/JO 




1 C 1 AC 


C4 


M 




1 AA 

loy 


151-07 




M 


OA1 A 


315 


152-04 




M 


10077 


364 


152-09 




M 


3054 ' 


200 


154-01 


Mar 19/00 


F 


2472 


256 


154-03 


c< 


F 


6425 


706 


154-04 


M 


F 


ILCL 


n,(L 


154-05 


M 


M 


2767 


213 


154-06 


«< 


M 


341 


39 


154-07 


CC 


M 


4029 


142 


154-08 


CI 


M 


1184 


47.4 


159-03 


Mar 20/00 


F 


1563 


116 


159-04 




M 


2285 


201 


"The number of males and females (M/F) in each Utter were 5/3, 7/2, S/4, and 2/3 for litt^ 
numbos 151, 152, 154 and 159, respectively. Saliva was collected from the pi^ets on day 
11. 
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Table 12. Primers used for construction and detection of transgenic constructs. 



Name 


Start-End' 


Forward/ 
Reverse 




Primers used in RlS/APPA+intron and R15/APPA construction 


APPA- 
D0WN2 




R 


TCGGCGCTCACCTTGAGTTC 


APPA- 
DRA 




F 


CXXnTTAAAGCCATCTTAATCCCAT 


APPA- 
SMA 




R 


GTCCCGGGTATGCGTGCTTCATTC 


CAT-ATG 




R 


CCATGGTGGCGGCTriTAGCTTCCrrAGCT 
CCTGA 


CAT-TAA 




F 


AGCGCTTGCAGTTTGTAAGGCAGTTATTG 
GTGCCC 


CAT-UPl 




F 


TCG AGG AGC TTG GCG AG A TT 


R15-UP1 




F 


TTTCXKKKXIAATGTTGCTGT 




Primers used in SV40/APPA+intron construction 


SV-HIND 




F 


OXAAGCTTTACACTTTATGC 


SV-XHO 




R 


GCCCTCGAGCXrrCCTCACTACrrTCT 




Primers used in Lama2/APPA and Lamal/PSP/APPA construction 


APPA- 
CLA 


12635-12657 


F 


GGATCGATAAAAGCCGCCA<XATGAA 


APPA- 
D0WN2 


13307-13326 


R 


TaKKXSCTCACCTTGAGTTC 


APPA- 
DOWN4 


12751-12780 


R 


GCACGCACACCATGACGACTGACAATCAC 
C 


APPA- 
KPN 


13935-13959 


R 


CGGGTACCTTACAAACTGCAAGCGG 


APPA- 
MATURE 


12719-12738 


F 


CAGAGTGAGCCXjGAGCTGAA 


APPA- 
UP2 


13210-13229 


F 


CXjAACTGGAACGGGTGCTTA 


LAMA- 
CIA 


12615-12639 


R 


GCATCGATCTTTGGTTCTGACAAATGG 


LAMA- 
SIGNAL 




R 


TQACTCTGAGTTCCCAATGA 


LAMA-UP 


12111-12130 


F 


GTGCTGCTCCAAGTTTGGTG 






Primers for detection of the Dordne B-«Iobin eene 


PIG-BGF 1 


F 


GCAGATTCCCAAACXnTOGCAOAG 


PIG-BGR 1 


R 


tctgcccaagtcctaaatgtgcgt 



1 The location of the primers shown for Lama2/APPA sequence. 

The start and stop codons of APPA arc indicated in bold letters, the optimal initiation sequence for 
translation is italicized, and the restriction sites for restriction enz/mes are underlined. 
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TBDE EMBODIMENTS OF THE INVENTION IN WHICH AN EXCLUSIVE 
PROPERTY OR PRIVILEGE IS CLAIMED ARE DEFINED AS FOLLOWS: 

1 . A transgenic non-human animal that carries in the genome of its somatic and/or gemi 
5 cells a nucleic acid sequence including a heterologous transgene construct, said construct 
including a trangene aicoding a protein, said transgene being operably linked to a fitst 
regulatory sequence for salivary gland specific expression of said protein. 

2- The animal of claim 1 wherein said first regulatory sequence comprises a saliva 
10 protein promoter/enhancer sequence, wherdsy said animal expresses said protein in its saliva. 

3. The animal of claim 1 wherein said animal is amaimnaL 

4. The animal of claim 3 wherein said animal is chosen &otn the group comprising pigs, 
IS goats, sheq>, cows, horses, rabbits, rodsits, cats and dogs, and in addition, fish and poultry. 

5. The animal of claim 1 wherein said saUva protein promoter/enhancer sequCTce 
conqirises a parotid secrctoiy protdn (PSP) promot^enhancer, a proline-rich protein (PRP) 
promoter/enhancer or a salivary am^ase promoter/enhancer. 

20 

6. The animal of claim 5 wherein said promoto/CThancer is a parotid secretory protein 
(PSP) prpmot^enhancer. 

7. The animal of claim 6 wherdn said parotid seo^eloiy protein (PSP) 
25 promotei/eiihancer is dedved firom a mouse. 

8. The animal of claim 5 wherein said promoter/enhancer is a proline-rich protein (PRP) 
promoter/enhancer. 

30 9. The animal of claim 8 whorein said proline-rich protein (PRP) promotet/enhanco- is 
derived firom a rat 
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10. The ammal of claim 1 wherein said transgene is further opo^ly linked to one or 
more second regulatory sequences including enhancers, transcription regulatory sequences, 
termination sequences, and polyadenylation sites. 

5 11. The animal of claim 1 wherein said transgene comprises a gene encoding a protein 
having phytase activity. 

12. The animal of claim 1 wherein said transgene OTcodes a phytase or a homologue 
thereof. 

10 

13. The animal of claim 1 whCTcin said animal is a pig, said transgene comprising a gene 
OTCoding a protein having phytase activity and wh^in said first regulatory sequence 
comprises a parotid secretory protdn (PSP) promoter/enhancer or a proline-rich protein 
(PRP) promote/enhancer. 

15 

14. The animal of claim 1 wherein said transgene construct comprises a nucleic add 
sequence according to SEQ ID NO:3, SEQ ID NO:5; or SEQ ID NO:7^ 

15. A transgenic non-humaii animal that carries in the graome of its somatic and/or gemx 
20 cells a nucleic add sequence including a heterologous transgene construct, said construct 

including a trangene encoding phytase or a homologue thereof. 

16. The aninud of daim 15 i^erdn said transgene is operably linked to a first regulatory 
sequence for salivary gland specific esq^ression of said phytasa 

25 

17. The animal of claim 16 i^exdn said first regulatory sequoice comprises a parotid 
secretoiy protdn (PSP) promota/enhancer, a proline-rich protdn (PRP) promotei/enhanc^ 
or a salivary amylase promoter/enhancer. 

30 18. The animal of claim 17 wherein said animal is a mammal 
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19. The animal of claim 18 wherein said phytase or a homologue thereof is expressed in 
saliva or in the gastrointestinal tract of said animal. 

20. The animal of claim 15 wherdn said transgene construct comprises a nucleic acid 
5 sequence according to SEQ ID NO:3, SEQ ID N0:5; or SEQ ID NO:7. 

21. A method of expressing a protein, the method comprising the steps of: 

a) introducing a transgene construct into a non-human animal embryo such that a non- 
human transgenic animal that develops jfrom said embryo has a genome that comprises said 

10 transgene construct, wherein said transgene construct comprises: 

i) a transgOTc aicoding said protein, and 

ii) at least one regulatory sequence for gastrointestinal tract specific expression 
of said protein, 

b) transferring said embryo to a foster female; and, 

IS c) developing said embiyo into said transgoiic animal 

whoein said transgene is produced in ttie gastrointestinal tract of said animaL 

22. The method of claim 21 wherein said regulatory sequence provides for salivary gland 
or pancreatic gland specific expression of said protein. 

20 

23. The method of claim 21 who:^ said regulatory seqxioice provides for salivary gland 
spedfic expression of said protdn. 

24. The method of clsdm 23 wherein said salivary gland is a parotid gland, submaxillary 
25 gland, or a submandibular gland. 

25. The method of claim 23 wherein said transgene is expressed in flie saliva of said. 
animaL 

30 26. Ihe method of claim 21 wherein said transgene is heterologous. 
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27. The method of claim 21 wherein said at least one regulatory sequence comprises a 
salivary protein promoter/enhancer sequence. 

28. The method of claim 21 wherein said protdn is a glycoprotein. 

5 

29. A transgenic animal adapted for expressing a protein according to the method of 
claim 21, or a progeny thereof 

30. The method of claim 21 wherein said protein is a phytase or a homologue thereoIT 

10 

3 1. The method of claim 2 1 wherein said transgene construct comprises a nucleic acid 
sequence according to SEQ ID NO:3, SEQ E) N0:5, or SEQ ID N0:7. 

32. A process for producing a protdn comprising the steps of: 

15 a) obtaining saliva containing said protein fiom a non*human transg^c animal, said 

animal containing witfiin its genome a transgene construct, wherdn said transgene construct 
comprises: 

i) a transgene encoding smd protein, and 

ii) at least one regulatory sequence for salivary gland specific expression of 
20 said protdn, and 

extracting said protein fiom said saliva. 

33. The process of claim 32 wherdn said transgene is heterologous. 

25 34. The process ofclaim 32 wherein said at least one regulatory sequencer 
salivary protein promoter/enhancer sequence. 

35. The process of claim 32 wherein said protein is a glycoprotein. 

30 36. The process of claim 32 wherein said transgene construct comprises a nucleic acid 
sequence according to SEQ ID N0:3, SEQ ID N0:5; or SEQ ID N0:7. 
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37. The process of claim 32 wherein said protein is a phytase or a homologue thereof. 

38. The process of claim 32 wherein said salivary gland is a parotid gland, submaxillary, 
or a submandibular gland. 

5 

39. A metiiod for expressing a phytase or a homologue thereof in a non-human aniTn;^!^ 
said method comprising: 

a) constructing a nucleic acid sequence including a transgene c-onstmct comprising: 

i) a transgene encoding said phytase or a homologue thereof, and 
10 ii) at least one regulatory sequence for gastmintesdnal tract specific expression 

of said protein, and 

b) transfecting the animal with said nucleic acid sequence; 

whereby said animal carries within the genome of its somatic and/or germ cells said 
transgene construct and wherein said animal expresses said phytase or a homologue thereof 
IS in its gastrointestinal tract 

40. The method of claim 39 w heie iu said transgene constract results in salivary gland or 
pancreatic gland specific expression of said phytase or a homologue thereo£ 

20 41. The method of claim 40 wherein said regulatory sequence provides for salivary gland 
specific expression of said phytase or a homologue diereof. 

42. The mettiod of claim 41 wherdta said salivary gland is a parotid gland, submaxillary, 
or a submandibular gland. 

25 

43. The metliod of claim 41 whecdn said phytase or a homologue tiiereof is expressed in 

tte saliva of said TnatninaV 

44. The method of claim 41 whorein said transgene construct comprises a nucleic acid 
30 sequence according to SEQ ID N0:3, SEQ ID N0:5; or SEQ ID NO:7. 
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45. The method of claim 39 wherein said nucleic acid sequence is introduced into said 
animal in the form of a transgene constiuct 

46- The method of claim 45 >\1iarcui said transgene construct is a nucleic acid molecule. 

5 

47. The method of claim 46 wherein said plasmid comprises a nucleic acid sequence 
according to SEQ ID N0:3, SEQ ID N0:5, SEQ ID N0:7, or SEQ ID N0:6. 

48. The method of claim 39 wherein said animal is chosen from the group comprising 
10 pigs, goals, sheep, cows, horses, rabbits, rodaits, cats, dogs, fish and poultry, 

49. The method of claim 48 wherein said animal comprises a mouse or a pig. 

50. A nucleic acid molecule comprising a nucleic acid sequoice including a gene 

15 encoding a protein, said gene being op«ably linked to at least one regolatoiy sequence for 
gastrointestinal tract ^edfic escpression of said protein. 

51. The molecule of claim 50 wherein said at least one regulatoiy sequence comprises a 
salivary protein promotei/enhancer sequaice, whereby cacpression of said protein is salivary 

20 gland specific. 

52. The molecule of claim 5 1 "herein said salivary protdn promoter/CThancer sequence 
comprises a parotid secretory protein (PSP) promota/enhanccr, a proline-rich protein (PRP) 
promota:/enhancer, a salivary amylase piomotei/enhancer, or a SV40 promotei/enhancer. 



25 



30 



53. The molecule of claim 51 vdierdn said protem con:^)iises aphytase or a homologue 
thereof. 

54. The molecule of claim 53 wherein said molecule is a transgene constiuct 

55. The molecule of claim 54 wherein said molecule is a nucleic acid molecule. 
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56. The molecule of claim 55 whCTein said molecule comprises a nucleic acid seqtraice 
according to SEQ ID NO:l, SEQ ID N0:2, SEQ ID N0:4, or SEQ ID NO:6. 

57. The molecule of claim 53 wherein said molecule includes a nucleic acid sequence 
5 according to SEQ ID N0:3, SEQ ED NO:5; or SEQ ID N0:7. 

58. An antibody specific to a protein expressed by a nucleic acid sequence according to 
SEQ ID N0:3, SEQ ID N0:5; or SEQ ID N0:7. 

10 59. The antibody of claim 5 8 wherein said antibody is monoclonal. 

60. The antibody of claim 58 wharein said antibody is polyclonal. 

61 . A hybridoma secreting the antibody of claim 59. 

15 

62. A host cell transfected with molecule of claim 50. 



63. A host cell transfected widi the molecule of claim 56. 



20 64. A host cell transfected with the molecule of claim 57. 



65. The host cell of claim 63 wherein said cell is an bacterial cell 



66. The host cell of claim 64 wherdn ssid cell is an animal cell 

25 

67. A diagnostic kit for immunologically detecting a protein expressed by a nucleic add 
sequence according to SEQ ID N0:3, SEQ ID N0:5; or SEQ ID NO:7, the kit including an 
antibody specific to said protein. 

30 68. The kit of claim 67 wh^ein said antibody is monoclonal. 

69. The kit of claim 68 wherein said antibody is polyclonal. 
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Figure 1 (continued^ 
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13 kbps '"tron 12.4 kbps 



Figure 1 fcontinued't 
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pKS+ Adapter Lama2 

Kpnr Rsdl 
-G GTACI IGGGAGGTCG | GTCCG- 
--C^C^fGACa:TCC^GCCAG1 GC- 



Apaf 




PSP Amp 

pKS/Lama (Apal-Kpni) 

6 kbps 



Kpnl 



Figure 3 
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3a (PCR1) 
Lama-up Apal Lama-CIa 



pKS/Lama2 



PSP 



MCS 



3b (PGR 2) 
Lama-up Apal Lama-signal 

MCS 



PSP 



Appa-Cfa 



R15/APPA 



Appa<Kpn Appa-mature 



ADoa-Kon 



R15PRP appA SV40 intron R15 PRP appA SV40 intron 



4a 



4b 



Apal aal Clal 



Kpnl Apal 



Kpnl 



LAMApcri 
176 bps 



APPApcri 
1315 bps 



LAMAPCR2 
229 bps 



APPAPCR2 

1237 bps 



Apal 




PSP Amp 
pKS/Lama (Apal*Kpnl) 
6 kbps 



Apal 




Apal 



Figure 3 (continued^ 
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Figure 5, The nucleic acid sequence of the Lama2/APPA plasmid (SEP ID NO: H 



LOCUS 

DEFIKITION 
ACCESSION 
KEYWORDS 



REFERENCE 

AUTHORS 

JOURNAL 

FEATURES 

DEFINITION 
ACCESSION 
VERSION 
SOURCE 
ORGANISM 



SYN 



17-aAN-2000 



Lama-appA 20623 bp DNA CIRCUIiAR 
Lama 2/APPA transgenic construct 
Lama 2-appA, 

parotid secretory protein; acid glucose -1 -phosphatase; 
gene ; 

periplasraic phosphoanhydride phosphohydrolase; artificial 

sequence ; 

cloning vector 

1 (bases 1 to 20^23) 

GolovajQ., S., Forsberg, C.W., Phillips, J. 
Unpublished. 



appA 



M, musculus Psp gene for parotid secretory protein. 
X6B699 

X68659.1 61:53609 
house mouse. 
Mus musculus 



Mammalia; 
Mus. 

and Hjorth^J.P. 



Eukaryota; Metazoa; Chordata; Crania ta; Vertebra ta; 
Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; 
REFERENCE 1 (bases 3777 to 5332;) 
AUTHORS Svendsen,P., Laursen^J., Krogh-Pedersen,H. 

TITLE Novel salivary gland specific binding elements located in the PSP 

proximal enhancer core 
JOURNAL Nucleic Acids Res. 26 (11) , 2761-2770 (1998) 
KEDLINE 98256451 

REFERENCE 2 (bases 7147 to 12653; 13952 to 17731) 
AUTHORS Mikkelsen,T,R. 
TITLE Direct Siibmission 

JOURNAL Submitted (07-OCr-1992) T-R. Mikkelsen, Department of Molecular 
Biology, University of Aarhus, CP Mollers Alle 130, 8000 
Aarhus, DENMARK 

REFERENCE 3 (bases 7147 to 12653? 13952 to 17731) 

AUTHORS Laursen J, Hjorth JP 

TlTIiE A cassette for high-level expression in the mouse salivary glands. 



JOURNAL 
MEDLINE 



Gene 1997 Oct 
9370303 



l;19B(l-2) :367-72 



FEATURES Location /Oualifiers 

source l.to 12653; 13952 to 17731 

/organisms "Mus musculus** 
/strains "CSH/As** 
/dbjxref =«taxan: 10090« 
/chromosomes ** 2 " 

/naps"Eatiiaate: 69 cM from centromere*' 
/clan&s«Lanibda TPl, Lanibda yP3, Lambda YP7*' 
/cXoDeJLib»"Lanbda-PHAGE (Lambda L47.1)« 
/germline 
/notee^Allele; b" 
miscJEeature 3777-5332 
/gene^'PSP" 

/functions •'salivary gland specific positive acting 

regulatory region" 
enhancer 7147 . . 6724 

/ evidencessexperimental 
exon 1177 8.. 11824 

/gene««Psp" 

/note="exon a* 

/numberai 

/.evidenceaBexpecimental 
exon 12626.. 14190 

/gene«*»Psp'' 

/notes "exon b fused with exons h and !*■ 
misc feature 12644-12652 
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Figure S (continued): 

/function^" consensus sequence for initiation in higher 
eukaryotes * 
misc__feature 13952-13965 

/functions" M13mplB polylinker" 

DEFINITION E. coli periplastnic phosphoanhydride phosphohydrolase (appA) gene, 

ACCESSION M5B708 L03370 I<03371 io3372 li03373 L03374 L03375 
VERSION M58708 .1 GI : 145283 

SODRCE Escherichia coli DNA. 

ORGANISM Escherichia coli 

Bacteria; Proteobacteria; gamma siibdi vision; Enterobacteriaceae; 

Escherichia . 



REFEREhJCE 1 (bases 12653 13951) 



AUTHORS 
TITLE 



JOURNAL 
HEDLINE 



Dassa,J., Marck,C. and Boquet,P.L, 

The' complete nucleotide sequence of the Escherichia coli gene appA 
reveals significant homology between pH 2.5 acid phosphatase 
and glucose- 1 -phosphatase 

J. Bacteriol. 172 (9), 5497-S500 (1990) 

90368616 



FEATURES 

Source 



sig_peptide 

/gene^^appA" 

CDS12653 



Location/Qualifiers ^ 

12653. -13951 
/organisinB**Escherichia coli" 
/db_xref = " taxon ; S€2 ■ 
12653.. 12718 

13951 

/gene»"appA" 

/standard_name»*acid phosphatase/phytase" 
/trausl_table=ll 

/product=''periplasiaic phosphoanhydride phosphohydrolase" 
/protein__id="AAA72086 .1" 
/db xref««GI:14528S" 



/ translat ion= "MKAILIPFLSLLIPLTPQSAFAQSEPBLKLESWIVSRHGVRAP 

TKATQLM0DVTPDAWPTWFVKLGWI,TPRGGELl7k^^ 

GQVAIIJiDVDBRTRKTGEAFAAGLAPIXMTVHTOAOT 

NVTDAILSRAGGS I ADFTCTRQTAFRELERVLNFPQSNLCLKRKKQDESCSLTOALPS 
ELKVSADNVSLTGAVSLASMLTEIFI-LQQAQGMPBPGWGRITDSHQWNTLLSLHNAQF 
YLLORTPEVARSRATPLI^LIKTALTPHPPQKQAYGVTLPTSVLFIACmiy^^ 
ALELimi»PGQPDNTPP0(3SLVFERHRRLSmSQWIQVSLVFQTIi^^ 
PPGEVia#TLAGCEERNAQGMCSIiAGFTQrVNEARIPACSL*' 
inat_peptide 12719 13948 

/genees "appA" 

/products "periplastic phosphoanhydride phosphohydrolase*' 

mutation r^lace (12659. . 12661r"gc9 changed to gee") 
/genc«"appA* 

/standardjnaiae«''A3 mutant" 

/notes "czeaced by site directed mutagenesis" 

/citations (31 

/phenotype«"sllent mutation" 
mutation replace (13934 . .13936, " cog changed to ccc") 
/gene="appA" 

/standard_name=" P428 mutant" 

/notea" created by site directed mutagenesis" 

/citation= (3) 

/phcnotype=" silent mutation " 
mutation r^lace (13937. .13939," gcg changed to get") 
/gene=*appA" 

/Btandard_name«" A429 mutant" 

/notes "created by site directed mutagenesis" 
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Figare 5 (continued): 



DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITIiE 
JOURNAL 

REFERENCE 

AUTHORS 

TITLE 

JOURNAL 

MEDLINE 

REFERENCE 

AUTHORS 

TITLE 

JOURNAL 

MEDLINE 

FEATURES 

Source 



CDS 



/citat:ion=(3] 

/phenotype=« silent mutation • 

pBluescript II KS(+) vector DMA, 
XS2327 

X52327.1 GI:S8061 

artificial sequence; cloning vector; egression vector; vector, 
synthetic construct, 
synthetic construct 
artificial sequence. 

1 (bases 17732 to 20€23) 
Thomas, E. A. 

Direct Submission 

Siibmitted (20*FEB-19dO) Thomas E.A. , Stratagene Cloning 
Systems,. 1109S North Tomey Pines RxJ. , La JoUa, CA 92037 r-USA 

2 (bases 17732 to 20623) 

Short, J. M., Fernandez, J. M. , Sorge,J.A. and Huse,W.D. 

Lambda ZAP: a bacteriophage lambda expression vector with in 

vivo excision properties 

Nucleic Acids Res. IG (15), 7583-7600 (1988) 
88319944 

3 (bases 17732 to 20623) 
Alting-Mee3,M.A. and Short, J.M. 
pBluescript II: gene mapping vectors 
Nucleic Acids Res. 17 (22), 9494 (1989) 
90067967 

Location/Qualifiers 
17732 to 20623 

/organisms "synthetic construct" 

/db_xrefe"taxon: 32630** 

cooplement (1B967. .19827} 
/genes "An^** 
/ products "b- lactama se ** 



BASE COUNT S449 a 4847 C 4902 g S424 t 

ORIGIN 

X TCGAGAQTAT CTTTGrCAGC TGTGOCTCCA ACAAAQGGGT ACT6TT6CCC ACATAGAAAG 
61 ATCTA AACTA ATTAATTAAX CCCTCACCCG CAAATCTTTC AGTCACXAAG TTAGCACGAT 
121 TGTTGAACAA GTTCTCCAAA GGAGAGATAC AGATCaOTGC GTATAGGGTG GACCTGGCTG 
181 CTGAGGAGAC AOCTGCATCT GACTAA6AAG AGGCA0G6TG TTAGTTGAAT GGTGTGGAGT 
241 AGGGTGGTTC TGT6GGACAG TAGAAAATCG AGAGGCATGT GCCGTTTAGT GAACTGATGG 
301 AAGCIACCCC AAACGaCAGA GATTGTCAGT CAGGCCAATC OGrTTTCGAGT TTGATGGGCA 
361 GCC3QGACAGT GAGACRQACA CACCTACTCA GTTGGAGGAA GGATGAGAAC AATGGCCAGC 
421 AGGGATTGAG AGACCCTGAC AGGCGCAAGG CCCTAACACA CACACCTACC ACCTCACTTG 
481 ACAAAGCTGC CAAAGAOCAA AGACTTGTTC TCCATTAGAA ATGACAGCTG GCTTGACCCG 
541 ACAGCATAAT AAGCAGAGTG TACTCTGATT GGAGAACTTT AATGTGTTTC ATTCAGTATT 
601 ATAAAAGGAC AGTATEACAG ATTTTGTTGT ACACTGCT6T TACATGTGGG GCAGTGTGTC 
661 TTTAAGTAGG GTAAAGTACT CTTTAAAAAT OGGTCCTAGA TATTTTTTCC TTTAACTCAA 
721 GTCTCTTACT GTTTAAATOA TTTTTATTTT 6TTTAAIATG GAGGAAAAA6 AAGCGTAAAT 
781 OGACAAIATA TATTTAGAGA AAGOIGGTTA GCTGTCAGAA AAATflTGCAA ATCAAAATCA 
841 CACCAAGACT GCAGCACACC CCTGTCAGAT GGCIGTGATC AAGAAAATAA AIGACAATGA 
901 GTGGTGGTGA AGATGTACTA AAGGGAAACA CACACACACA CACACACACA CACACACACA 
961 CACACTGGAG CAACCACTGT GGAAATCAGT ATGAATGGTC CTCAAAAACC TGAAGATAGA 
1021 GOGGGGOGTG GTGGCATACA CTTTTATTCC CA6CACTGGG GAGGCAGAGG CAGGTGGATC 
1081 TCTGAGTTCC AGGCCAGCCT GGTCTATA6C ACAGGTTCTA GGACAGCCAG G6CTACACAG 
1141 AAAAACCCTG CCTTGATTAA ACCAAACCAA ACXAAAOCAA AOCAAAOCAA ACCAAACCAA 
1201 ACCAAACCAA ACCAAACCAG ACCAAACCAA AACACTQAAO ATA/SAACTTC AGCATTOCAT 
1261 TCCTAGATAT ATACCCAAIG GAGACTAA(3T CAGCAAGACA CClbcACAGC CA' I G IT CACT 
1321 ACTACACTGT TCAOCACA6C CAGGCTGTQG AAOCAGCCTG AGT6TCCATG ATAAATGAAT 
1381 GGATAGGTAA CTTTCAAGGT AAATQGACTC T<jCTGTGTAC ATGCCTCACA TTCTGTTTAT 
1441 TCATTTTTCT TTATGAGGTG TCCATTCAGG AGTCACATGG TAGTTCTATT TTCAGTCTTC 
1501 TGAAG ATAC T ACACTGQTCC CCACAGTTTA CACTTTTATC AGCAGTGAAT AAGGGTTCCT 
1561 OTATCCTTAC CATCATTTGT TGTAATTTTT CTTGATGACC CTCTTTCTGA CAGQGATAGG 
1621 ATGTA ATATC AGTGTGAGGA AGTACAACTT GTTTTCTAAG TATTTATTGG CCCCTTGCAT 
1681 TTCTTCTTTT GAAAACTGTC GGTTCCTGAC ATCTGCTCAG GXATTCATTG GATGTTGTTT 
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Figure 5 (cx)ntiDuedV 

1741 CTTTGGTGTT TGAGTTCTTA TGAATTCTAG ATGTTAAATC CCTGCCTGTG GTTCTCTCCC 
1801 ATTCTGTAGG CTGCCTCCTC ACCCTGGCAA TTGTTGTCCT TGTTTTGCAG AAACTTTTGA 
1861 CTTCATGGAA TCTCATTTGT CAGTTTTCCC TCCTCTGCTA TAGCCTX3AGC TAATGCACTG 
1921 GTTTTTACAG AGCCCTGGTC TATGCCTTTA TCCTCCTCTG GCAGCTTCGG AGTTTCATTT 
1981 CTTACATTTA GATCTTTGAT CCACTTTGAA CAASTTTTC3G AGCAGGGTCA GAGATACGAA 
2041 TCTAGTTCCA TTCTTCCATA TGTGATCCTA GTTTACATAG CATOGrTOGT TGAAGAGGTT 
2101 TTATTTTATT TTTA AATAAT GTGTCATAAA AAACGAGGTG GTTGTAGCM TGTGGATTTG 
2161 TTTCnTGTC CTTTGATCTA CAGGTCTTGT TTTGrTGTCAG TCTCATGATX3 TTTTATTOCt' 
2221 ATGGCTCTGT CATACAGTCT GAGGTCAGGT ATTGTGATAT ACCTTCAGTA TTGCTCCXTTC 
2281 AGACTCAGGT rrGCTXTGGC CAGGAGTCAT CTTACTCAGT GCTCTTAGAG CTCCCCCRGC 
2341 ATGTAGCTGC TACTATTCTT ABTTGATAAA TCAGGAAACT GGGGCTCAGA GAGATTAACT 
2401 6TCTTGAACT ACTTCTGGGG AGGTGAAACG TGGAGACACT AAACTGTCTT TACCXrTGTAC 
2461 TGCTCCAGTA GCTGTOGGGT GCTGGOCTAC AGCAAAGCAC CTATACTATA TATTACTCAG 
2S21 GAGGTGGAAA AACTCAGCCT CCCrTGGGGT TCCCAAGCTC CCAGGTGTCC AGTCACTQCT 
2S81 GGAAACCTCA TGGAGTCTGA ARGGAAGGGT TGAGGGTACA TGGGGCAGCG ATGAGGAGCC 
2641 TGGGGCTGGG ATCTCCCAAA CACXTTGGATA TCCAGATGCC ACTGGGTCAG GGGGAGTTGG 
2701 GAA CAGAG TT GGGATGTCCA TGGACCTGTG ACAAGGCCAG GGCCAGGGGG AGGATAACTC 
2761 TGGCTTTACT AATTTGCGAA AGTCCTTAGC TTAGCAGCAG TTGTCTGGGA GCACAGAGGG 
2821 GCCTTCTGTA AGAGGCTCAG GCAGT6CCGC TCTGTAGGOG AAGGTCTTCT CCATGTTCCC 
2881 CATGGTGGTT CTTGATGAAA GAGACAGTCC TTGGCTCX3UV ACTGGTTTAr TGATTGTTCA 
2941 TTGTGGAAAA TGGGTGCACA CCACCTTCTC AGGGTGGACC AGAGATCAAA TACCTTTTGC 
3001 AGGGAGGAAT ATCTGGGAAG GGACGCTTAC TGGCTAAACC CTCAGGGCCT CTAGATACAT 
3061 CATTAGCATG GAGAACTCTG TTCTGOGCTA CATGACCACA GGCCAGATTT CCACAAGCCA 
3121 CATGTGGGAA GTGTGGCACA TGTTCTAGGC CAGGAATCTG GTAGGGAGCG TGGAGCCACC 
3181 TACCATCCCA GGTGGGTGCC TGGGTGCCAG GGACXXTOAA CCCGCTCAAC CTTACCAAGT 
3241 TTCCTGGCAG GGTCCACTGT CCTACACAGA AGCTGGAGGA GGTGTGAGGG TTGTGTCTTT 
3301 GTGGAA3X5TC CGATGCTGCT TGGGGCTCAG TTTCTCCACC TGTACCTCAT TGGTTTGGGT 
3361 ATAAAAAGTG GGGAIACTTT ATTATTCTCT GACTCXSGTCC TGAGGAAAAA 6CATOGIGGC 
3421 AGTCCAGGAA CCACACCCTG AGGTTCCTGC ACTGAAGGGA CTCOCTAAGT CTCTCGAGTC 
3481 TCTCCCCTTC ACAGAGCTGC CAAAGTCTAG GTTCTTT1X5A GQATAACAGA GCCATGCTTG 
3541 GTAAGCAGAC AACAGCATTT GTTTACTCAA CCTiVrmU TCAGCTOCCT CTTCATAAAC 
3601 AAGTTGAGAC ACCA3X5CTGG CTTGAGGAAG ACTTCTAAAG CCAGACAACT GTGC3WW3GAA 
3661 GAAGAAGAAG GGGCAAGTGG AGTTAGCCTG GATGTROCCC TCAAAGTCTC CA6AGACCAG 
3721 CCATGAAGGC TCAAGTGGAG GGCAAGACCT GCAGCAGCCA ASCATCTOGC AGGAGAQGAT 
3781 CCTGGGAACC CCTCTACCAT GACACACATT CTTCCTGCaG GTCACACTTA ATAGGCCATT 
3841 TCTTATTTGG ATCTATCATG GTGTTCTGTG CQAGATTAAT GAGGTGTTAT GCTGCGAACA 
3901 GAAAGTTATA TAAAAACRAG TCCC0CCCX3C TTGTCACTGC T6CIAAGAAT GTAGCAGAAA 
3961 TTGTCTCAAG TGTCTCTCXA AXCAGAAACA AXAAAGGTCT CCTTGGATTC AAGCCCTCCA 
4021 GTTTCCrcCT TCCTTGCXGA GCCTTOGACA CCCATACAAA CCIOCTGGAT GCTAlCAOCrC 
4081 TGGGCAGAGA CTOCAAGGTG GGGAGAGACT GATGGTACAA AAGCAAAATA CTTGmGGG 
4141 GGTACACCCA CTOCTCTGCC TGTGI TO I TU CIGCAGTCAO TOCTGCAGAC AGGCCCTCAG 
4201 TG GGTCT TCX: ATGGGCAACA OGCAGAGGOA GGCAATQQAT GOOAATACCC ACACCCTGGT 
4261 TAGTr EACOC 0GGCCA3GCT CTCTGCTCTT CATOCXTTCCT CTGCCCTCTG CCACGGCTTT 
4321 CTCTGCAGGA ATCATATCTT CAXAXTGGCC CACAGGTGTT CTCCTCUCCC TAGCTAIGAT 
4381 GTTTACTTTA GAGTGAOCTT AGCAGGGCTG GTGGGAATGA GTTCTAGAAG GCTCAOGGAG 
4441 AIGCXAGGGA AGAAACGTCT TCTAACTACT 6AGGTTACXA AGTTCCTGGT GGTTGTCTCT 
4501 GCCTTTOCCT TGTTAAAGTC ACCTTGAAGT TAGTGCAGAA GAAATCAGAG CCCAGTCACA 
*561 GAGTAAATA T GGTOCTGAAG ATTTCCrrrG AGTGCOCAGA ATCCATCACA TTTCAAGAGC 
4C21 OCTCTTTGTA CCTTAAGTCA TTTGGGGTTG XATCTTCTGC TrGATGXATG TGTGTGTGTT 
4681 TATCAAAGAG TGAGATGGTT ACATAAGAGG TGCTCTAAAG GACAQAGAGG ATTTGCAATT 
4741 GTGGCATGTG ACATCCTCAG GCCTTGCTCT GGTGOCAGGA GGAACTGATG CAGAAAAGAG 
4801 TAAGAGGTCA TTTCCTGGAG GCTOTCACTA TAGAGGAGAT CTTACAGTGC ATTCCCTCXrr 
4861 CX3U3GCCCTG CCTGAGGATA GACAI6TGCT GACTGCAACT GAAACAGAGG CTTGGGATGG 
4921 AGAGTTAGGT TCACAGAAGG GAGGGTGGGA GATCGATGCT TGCTQGGTTC TGGGTCTCAT 
4981 CACXIAGCTCC TGACCACXXX3 GTCAGCCXAT GIQCTTATTC CATAGCTTTC TTTTOCTATO 
5041 TTTACTCAGT GTGGTGTTTG TTGGQACC3CA OCAGAAGCCA GTCCXAGGCT GACAGCrOTG 
5101 GATACACAGG GCAGCATGAG GGTGCTCA3C CTGAAGGAGT CAGGCIX»3CA OAAGAGAAAG 
S161 ACCAGCACAC ATTCCTTCAA CCMCIhWr CTTGAAAAAC AAACATATTA TATCACAaCAT 
5221 ATTGCATTTA TCAGACAQCT AAAAICTACT OGQQXAGCAT OACTOCAGGT GGGGATATCT 
5281 GCAAGTGCrA TGAGTGGCAG AGGGACAGCC AATGT6AGGC AAGAAGGAAT TCTGGCTCAA 
5341 CACAGCrXAG CTOCXTrOOIG TTOGTTGAAA CTTTGAGAGT GCACTTTATT 
5401 TTTGACATAT ITAAACAGAG CACAACTTTG GGAAAAAGTT TTCTTATGAA AATTATCACA 
5461 ATAAAGCTTA AOGCATGACT ACATTAAAAT GCCnTGCAA AGTATATGTG CCCTCTTCCA 
552 1 CAAGAATGOT TCTATTGACT 6AGAAATAAT GTTCAGGATA AAGATCCAGG AAGAAAAGAT 
5581 CAGGGATAAG TAAAATACXA AACTCTTTTG CAAAGTACAT AGACCCTCTT TCATAACAAT 
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Figgre 5 f continued): 

5641 GGGTTCTATT GACTGACAAG CACTGCTCAG GAGTTGGGAA AGAGTCTAGC ATAAGCACGA 
5701 TAGCCTGGAG ACTCTAGTGA GGTCTAGTCT TACAGACAGC AAAAATCACC AGGTTACAAA 
5761 CTACATTCAT TTCCAGTTTT CTGATCAGGC ACAGGTATGA ATCCCTTCTG TTGAAGAGAA 
5821 AAGTCCATGT GTTTAAAATA TCTGGTTTCT CCAGTGCTAT TAGCGAGAAG ACTTGAGCCC 
5881 TATACAACTC CCACCTGGAG TGACATCCTG TCTTCATGGT ATATTACATA CCTAGACACG 
5941 CTCATCTCAC AGACTTAGGA CTTTGTCTTC TGATCTCCAT TTCTGATCCC ACTTCX:ACCT 
€001 TTGCXTTTGAT AGTGTCATTT TCTTCACTGC CTT6GTGACA AtXATGTTAT CCTCTGTGTA 
6061 TrroAGTGTT ACCATTTTCA GATTTTACXTT GTATGCAAGA TCACACAGTC TTTGTCTTTC 
6121 TGTCTCGATG CATGCTAATC TCTACACAAC AACCXTTTCCC CGTCACTCRG ATCTTCCTCC 
6181 ATTAACACAT ACATGGTGCT GAAGAGGCTA GGGAGCTTCC CTTCAGTGGG GAGCTAGCTG 
6241 GCTATTGGGC CTTTTTGACT GTCCAGGAAG GCCCCCAATT GCTGAGACAA GAACTTAGAT 
6301 TCTTCATTAT TGACTCTAAC TCATGTATCA AGCAGAAGCT AATGAATAGT TATCAACAGG 
6361 ATCAGAGGTT CCAGTGTAAG ACACTTTGAC ATGAAAGAAC GGAGGAAGGA CAGATGGATG 
6421 CATAAAAGCA GGACCACTGC CCCAGGAACG TCCTGGAAAC TGATGCAGGG CAAAGOACAG 
6481 GTTATAAACC AAATCTTAGG GAGTCAGGAA GAGCACAGAG GTUSCTCAACC AACTGACCAC 
6541 TGCTTAGGGG CTACCAACCC AATCXJTCCCT GTGGGAACAG CTAAGCTATC AGCCAAGGGT 
6601 AATAAACAGG CAGGACCTGT GGA3X3ACATG GAGAGCATAG GGACCCTGGG TCCAGCCTTT 
6661 AGCACCTGCA CTCTCAGGAT ACTCCACCAT TGTGTCTTAG AGAGCCTAGG GATACTGGGT 
6721 CCAGCCTTTG GTACCTTCAC TCTCAGGGTA CCXXATCACT GTGTCTTGGA GAGCCTAGGC 
6781 ACCCTGGGTC CAGCCTTCAG TACCTGCGCT CTCAGGACAC CCCACCA!rTG TCTCTTGCXX: 
6841 CGTCTCTTCT TCCTCTTCCT CCCTTTCATT GTCTCTTCTC TGTTTCTTTC TTOACTCTCC 
6901 TTTCCCCTCA CACCCTCACT CtAGTTCTCC CCTTCCCTCT CTGCATCACC CTATTCTCTC 
6961 TGTGGTCCCT CCACTTTCCT TTATCTCTCA TCCTTCTCTC CTCCCTCAAA TACTTGTCAC 
7021 CCACTATACT TCAGGGGCCA GCTCXAGTGA CAAAGCTGTT AATAGCAAGA CTCTCAGATC 
7081 TCCRAOSGCT CAGAGGAGCC AGACCCACCA AGAACTCTCT CCAGGTCCAA TTTCAGGXrC 
7141 CTTOGAAAGC TTTCAGCAAA TGCXCAGG6A ACATGCCACT AACAAGAAGA TGCAAATTCC 
7201 AGTTGAGAGT GGGAAAGQCC CTTGOGTAGG TCCXrATCTTC CAGGCCAAGG TCAGAGGGGC 
7261 TCTGTGTAAT COGGATTGAC AGGGCTCAGA ACAATGTTTT GTTTTTAAGG TTTATTTATT 
7321 TrAGGTGrEA GrGTCTTTGC TTGGATGACC TCATGTSCM CATGTGTGTG CAGGTTCCTG 
7381 ATGACASTAG AGGAGGGCTT TGAAT0CCT6 GOGATAGGAA 6TTACAGGAA ATXATAAGCT 
7441 GCTTTGTGGG TCTTCIAGCT CTC0CARCA6 AAGTGAATGC TCITCACXSU: TGAGCCATCT 
7501 CTCTAGGCCC AAGftGACATT GCTTTATOGA TAIRATTGTO TGTOTGTGTC AACATTGAGG 
7S61 AAAGGGAAAT AAAAAAAAAA CTTCRGCCGC TAAGGTTCrTA CAGTTTCRCT AATTOCTACT 
7621 TTTAGTTGTG ATAAAATGGC AGGTGCrTCA ACATTTATAT ATACAAAAAC TTCCCTOCTG 
7681 GtGGTTCAAC TGTGAGAACT GGGGTAASTG GGTGAGTTCT CTTTTTCTC5T CTCTGTCTCT 
7741 GTCTCTCTCC TTCCATTCTT TCTTAAAGGA AATAAACATT GCAGCTGGGT TATAGCTCAT 
7801 CSUn^TGGAA GTTACAGAAG TGAAAAAAG6 CATT6CCTTG GTGGGTGGTG T TAOC AGCTG 
7861 ATTTTTGGrT GTCCTGCAAG GAGGT CTG GG GACIGGCTGC TCTGTCTCTG TCIGTATGAG 
7921 TGAGGGAAGT CTGQGGAGCA GATTCCCTAA CCTTCAGCCr GGCCTGGTTC CTQAGIGAAC 
7981 CCAGCCTCTC TGGTCCTAGT AGCTTTTTCC AAACAGGAAT CTGAGTGGIG ACA06GAACA 
8041 AGTACCAGCC CATTGCTTAA GTGCCAGGGT TAGTGAGG6C AGGAA6CTGC GA3AGCTGGG 
8101 ATTAGTAGTT GTATTGGATG TAGGAAGTCC TAaXXTOGOA CAGCTAATCC TTAAXGCTTC 
8161 ACTGGAGATT TTCAATGAGA AA!ITTATCXX: AOGGCCCATA TGGCXCCATC CTTTTGTCTC 
8221 CAACAGCCAA GTArrTTCCA TTAGAGGAGA CTTCXnGTAC ACTXGATGGA TGCTGATTCC 
8281 AAGGTGACTT GGGGCAGTCA GTACAGACTT GGGATGAOCT CTGACAGCCT AACCTCTCCC 
8341 CAACAAOGGC CCTCTATGTT TGCTATGXAA TGTAATGTCA GACATTGTCA GGAGTGTCCG 
8401 CAGCACAGCC TGCXTAGTCT GAGGGCTCTC ATAGGTTTCC CACXX3TCTTA TCXACACAGG 
8461 GATAAOGAGG AGGTAW3CTG CAGTTCCC A G TCTCACTTCA CAGAGGAAGA GAtAACCCCA 
8521 TCCCAGGTCA TGTAGCCAGC AGTGGAAAGA ATGAGGATTT GAACTCAGGT CTTCCAAGTC 
8581 CCATTGATAG CATCTCCTCA CAAGTOCCTT GCCACCCTCA CGATGCCTTA GACACTTGCC 
8641 TGCCCTTTAT ACTAAGGAGA TGCAGGTACA AGGGGTTTAC CCATGTAGCA GCTGAGGCAG 
8701 CTQGGGATAG ATACXaWGCAG CAGGOCTGAT GTCACXACTC TAACTCCAGC ATCCCCAGTC 
8761 TG'itSrr C ClG GAGTGTGAAA ATCCCTACTT AACAAGATTG TGCAACAGTC CTTGGCTCTG 
8821 TGAC0CATA6 CTGGAAACAG GAirCTCATT GATTT G T GG A ACATGGTGGC AGCCAGCCaA 
8881 AAAGAGG6TC T6CATACAGA AGACA08TGT GGCAAGGCCA CAGCAGACTC TGACTACXfTT 
8941 AGCTTACAGA ATTACAAGGT CATAATOTCC TCTGCTTTSG TCACCTCATG TTAAGOACAG 
9001 GCCCXAATQA AGAXGGQGCA GAAGACTGAA GGAATGGCXA AC2CAATAACT GGCCCAACTt 
9061 GAGACCX3VTC CTACAGGCAA aCMXSJOTC CTGACACTAC TAAIGAXACT CTOTXATGCT 
9121 TGCAGACAGA AGCCTAGCAT AACTATOCTC 06AGAG6TCC ACOCAGCAAC TQACTGAAAC 
9181 AGAAAAAGAT ATCCACAGGC AAACAQTGGA TGGAGGTCAG GGACTATTAT GGGAGAGCTG 
9241 TGGGAAGGAT TAAAAACCCT GAAGGGGATA GGAACXXX:AC AGGAAGACCA ACAGAGTCAA 
9301 CTAAGAGACC TGTGGGAGCT CTCAGAGACT GAGCXACXAA CCAAAGAGCA TACACAGGCC 
9361 GGTCCX3AGGC ACCTGGCACG TGTGAAGCAQ ACATGCAGCT CAGTCTCCAT GTAGGTCCTC 
9421 CAATAAGCGG TAGCCTGACT GCAGTATCCA ATCCCXIAACA GGGCT6CATA GTCTGGCCTC 
9481 AGTCGGGGAG GATGCCOCXA ATCXTTGCAGA GACTTGATOA GTGGAGAGCT ATCCAGGGGG 
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figure 5 (continued): 

9541 AACCCACCCT CTCTGAGAAG 
9501 GGACAAACAA GAACCTCAAA 
9651 GCTCTGTCCT GTTCCTCAGC 
9721 TTTCATGGGC CTAGGACTTG 
9781 TACAAGACTA GGGACATTCC 
9841 CCCAATCACT GGATTTCTAC 
9901 ATTTCTTCCA CTGAAAAATG 
99G1 AAAGAGGTGT ATCTAACCTG 
10021 TAGGAAGGAC AAGAACTGCT 
lOOBl TCTCCTGCTG GTGCACAGGC 

10141 cxrrcxrrGTCA ggtcactggg 

10201 TAAAAAACAG TGGTCATG6C 
102C1 GAGCXGTGGG CATGAACCAC 
10321 G6TGAGCTCA 6TAAAT6TGA 
103B1 GTTCCCAATA TCCTGM5GAG 
10441 CCTGAGAGTT ACCAGTTTCT 
10501 TCTTGTTC3GT GTGTGAGTGC 
10561 AACTATCAGG CSGTTGTCCCC 
10621 TTATTCCAGG GGAGCAAGTC 
10681 GACAATGGAG CCTCCATCAG 
10741 CTCXftCCCAC ATAGGAGTTG 
10801 TCAGTTACTT TATCTGGTAA 
10861 CATTTGGGAA CCTGGATCTA 
10921 CAGATTATAA TTCCCCCCTA 
10981 TCXXrrrTGGA CCACAGACAT 
11041 TGGTAGTATG TGGCTCACAG 
11101 TGTGAATCAT TCCCCXIAGTC 
11161 TCaACCTACT TTTCCCCTCC 
11221 TAATAGCCTT GGGCAGCATT 
11281 TTGAGTCAGT GAAAGAGTGA 
11341 CAGCAGAGGT GGTTGAAGCT 
11401 CACTGAGAAC CACAATTGAA 
11461 CAAAAAATGA CTGGCTAGCT 
11521 ATTGATTCAT TCTGGATCTT 
11581 CAAGTTTGAC AACATAGGGC 
11641 CCTGGGTGAC CTTTGGGTTG 
11701 TTTAGAACAT CCAGGAAATA 
11761 TAGAATOCCA GGAGATC6TG 
11821 AGCAGIXSAGT A6TGACTGCA 
11881 GGGOUUSAAG CAGAGGGAA6 
11941 GAAGACATAG AZGACATTGrT 
12001 AGGGATCAGA TTTXATTCAT 
12061 6AGIGTAG6C TCAATAA3GA 
12121 AAGmOGTG GAGAAAGGCA 
12181 GCTGGCTTCA TAGAAGGTGT 
12241 AAAGTGAOCA GGC3VA0GCA6 
. 12301 TTATAGAXAC ACACACATTT 
12361 TCTCrtAACT GCXAAGCACA 
12421 TCATTTTTCA TTGTGGCTGA 
12481 CTSAGGGCAG GCATATCCCC 
12541 TCrrGTTGAA AGGCAAGCGT 
12601 CCTGCTATGA CTCTCCATTT 
12661 CATCTTAATC CCATTTTTAT 
12721 GAGTGAGCCG GAGCTGAAGC 
12781 TCCAACCAAG GCCACGCAAC 
12841 GGTAAAACTG GGTTGGCTGA 
12901 CCAACGCCAG C3GTCTGGTAG 
12961 TCAGGTCGOG ATTATTGCTG 
13021 OGCCX3G6CTO GCACXTTGACT 
13081 OGATCCXSTTA TTTAATOCTC 
13141 TGA060GATC CTCAGCAGOG 
13201 GGOGTTTOGC GAACTGGAAC 
13261 TQAGAAACAG GACGAAAGCT 
13321 CGCCX5ACAAT GTCTCATTAA 
13381 7CTCCTGCAA CAAGCACAG6 
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GGAATGGGGA TGGGGGAGGG ACTCTGTGAA 6AGGGGACAA 
TAGGTCAGGC CCTAAAGGCT TGCTAAGTAG CAGTGGCCCA 
CCAAGGCTCA GCTCCCACCT GTTTCTGTGT TTTTCTGGCT 
GTGACCAGTT CAAACAATGG GGCCTGTGGA AGACACAATA 
TGTTCTGCTG ACTATCX:aTA GCCTGATGTA GGTGGAAGGA 
CCTTGCACAA CCTTGACAGC TCAGGGCCTC TCAGAAACCT 
AGACTCTCAA ATGAACGTCG TGACAATCAT CAGGCTTATT 
AATGGCAAGC AGACAGCAGG CAAATGTCTG TATCAACCTC 
CACTGCTGCC CCCCAGGAGG CGATPTGCrG AAACAGCTGC 
CXTTGCCTTCT CATTGCAGCC ACAGCCCCTT CCTGTCTGAA 
AAACAGATCA AGATGGAACA GGACAGCTCC TGATGGTAAA 
TATTCATAGG GGTTTATGCT TCTTCACSTCC ACACTGTGAA 
AGTGTTOGAG GTAGAGTTGG GGTTCTGAAA TTCACAGTGG 
GCTGOAGGTC ACTCXTTGAGA CACACAGTCC TGCTGCTTCT 
AC3GACACATC TA CTIWITC AGAGGCCACA GTCTAGTTGA 
TATTTGIGTC TGTGTGTGTG TGTGTGTGTG TGTGTGTGTG 
AGGTGCACAT ATX3ATAGCGT ACACXttTTGAG GTCSiGAGGAT 
TCCTACTTTT CCTOGGACTC TCGAGAACAA ACATGGGTCC 
GCTGTTGGCT GACACATCTT GCTCACMAC ATTTTACCTA 
AGTATTACTT TAGCTCCTCA CCGATGGCAA TOCACCACCT 
GGTCTCCACA CACCCCCACA CCCCCTTCAC CAAAACGTTT 
AGTTCATCAG AGAATGAAGC CAGTATTAAG AACATGGAAT 
GCAATACCCC ACCCTAGATO GAGTTGCTGA GTTTTCACCT 
GCTTCTATGG TTTATTCTGA AACCAGGGGA ACTOGATTCC 
CCTGGCTTGT GAATTCACAT GTCATCTACT GCTAATCCAT 
AGACACACTA CAGTCATGGC CaATGTCAAG GTAGGACAGA 
CTGCXXSTTTT CATGACXAAC CCTCCTCAGC ACAGTGACCA 
TTTTATTTTT AGAATTGCTG GAATTTTCTA TTTTGAGAAA 
AAACAAAATC ATCTAGAAAG CTGGTTTAAA AIACAGATGG 
GGAATOTCAT TATTCGCOCC TCACAGAGGC TGGCTCACXC 
CTTGGACACG GGTCAGGTGC ATAGGAAAGG TOGTCTGGGA 
CAAACAGAAC TGTTGGCTTT TTTTTTTTTA AATGAGTTCT 
TAGGCAAATA CTTCGAGCCA ACCCAACAGA ACATTCTTCC 
CTTTCTAGAC AATACTGAAC TGACCCCTTG TTGGCAGTCT 
TTTGAACTTG GCACAAGGTC CATCACTGTC ACCXAAGCAT 
GAATATCTTG GCTAACCTTA GATATTTTCT TTGGAGTATC 
GGGCTTGATT CTCATCCTGO GACCACAATA TAAGTCACCC 
CAGAGAAACA AGGATCTCTC TCGTGTGCAT OCTTCITCAA 
CXAAACTGA6 TTCOCATCT6 AGAGTCC3^ OGAGGCTTTG 
GCACTCTTTG T6TTG6XAAA GTTTTGACTC TAACAAATTT 
GTCAGACtAA CAACAACCTA GACTCAT6TG GGTTCTXITTT 
GAATGACTTG TCTKAGrGTA TAGAGAAAOG CTTOCTACTG 
CAGAAGAGAT AGCTATTTCC OCXAGGGACT GTGCIGCTCC 
GTGGOGAAOC TAQAT6TGCT CTCTGGGGAG GGGGTCIGAA 
GAAGmXGC TGAAACATCT AAACA6AATT ATA6CTTA6G 
OGAftlGICTr GCATATGtAT ATGTACATGA ATAXATXATG 
GAAOCIGAXT TOCAGATGAC AGAAAAIAGG TTATTrrGCC 
A3X3ACTTOCA GTTOCATC3CA TTTCCTGAAA TGCCACAATT 
ATAAAATTCC ATTGCAGACT GGGCCCTACT TCATCCACTC 
TOGCrcCATT TCTXACCtAT TGTGAAGAGA AGTGCAACTG 
GAGAGAGGCa GGCACXAATT GTGGGTTTTT giTr c ri ' dT 
GTCAGAACCA AAGATOGATA AAAGCOGCCA CCATGAAAGC 
CTCTTCTGAT TCCX3TTAACC CCGCAATCTG CATTOGCTCA 
TGGAAAGTGT GGTGATTGTC AGTOSTCATG GTGTGC6TGC 
TGATGCAGGA TOTCACOXA GACGCATGGC CAACCTGg6c 
CACCX5CGCGG TGGTGAGCTA ATCGCCTATC TCGGACATTA: 
COGAOGGATT GCTGGOGAAA AAGGGCTGCC OSCAGTCTGG 
ATGTCQACGA GCGTACCCGT AAAACAGGCG AAGCCTTCGC 
GTGCAATAAC OGXACATACC CAGGCAGATA OGTCXAiSTCC 
TAAAAACTGG OGTTTGCCAA CTGGATAACG CGAACGTGAC 
CAGGAGGGTC AATTGCTGAC TTTAC0G6GC ATOGGCAAAC 
OGGTQCTTAA TTTTCOOCAA TGAAACTTGT GOCTTAAAOS 
GTTCATTAAC GCAGGCATTA CCATCXSGAAC TCAAGGT6AG 
COGGTGCGGT AAGCCXCGCA TCAATGCTGA CGGAGATATT 
GAATGCCGGA GCCGGGGTGO GGAAGGATCA CCGATTCACA 
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Figure 5 (continued!: 

13441 CX»GTGGAAC ACXTTGCTAA GTTTGCATAA CGCGCAATTT TATTTGCTAC AACGCACX3CC 
13501 AGAGGTTGCC CGCAGCCGCG CCACCCCGTT ATTAGATTTG ATCAAGACAG CGTTGACGCC 
13561 CCATCCACCG CAAAAACAGG CGTATGGTGT GACATTACCC ACTTCAGTGC TGTTTATCGC 
13621 CGGACACGAT ACTAATCTGG CAAATCTCGG CGGCGCACTG GAGCTCAACT GGACGCTTCC 
13 661 CGGTCAGCCG GATAACACGC CGCCAGGTGG TGAACTGGTG TTTGAAOSCT GGCGTOGGCT 
13741 AAGCGATAAC AGCCAGTGGA TTGAGGTTTC GCTGGTCTTC CAGACTTTAC AGCAGATGCG 
13 801 TGATAAAACG CCGCTGTCAT TAAATACGCC GCCCGGAGAG GTGAAACTGA CCCTGGCAGG 
13861 ATGTGAAGAG CX3AAATGCGC AGGGCATGTG TTCGTTGGCA GGTTTTACGC AAATCX5TGAA' 
13S21 TGAAGCACGC ATACCCGCTT GCAGTTTGTA AGGTACCCGG GGATCACAAC TTGCCCTCTG 
13981 AAGAGGAAGA ACAGAAGGAT GCCACAACTC TCCTGCTGGC TACTCTCCAG TGGTTTCATC 
14041 TTACTTCTCA TGGCATTTCC CTCTAGAAAG TGCTACTATC ATCCACACAT TTCTACCTGA 
14101 GACCACCCAA AGGACCCTCC CAAATTCTCT TCXTTCTCTGA GTAGTCTCCA CACCTGTTAC 
14161 CACTATCCCA GAATTAAAAT CXTTAACTGCA CTCTGGCGTG TGACTTGCCT CAGTCCTTGC 
14221 AATAAGAGTT GTTGGCAGTG CXaOGCXSTGG TGGCGCACGC CTTTAATTCC AGCACTTGGG 
14281 AGGCAGAGGC AGGCGGATTT CTGAGTTCGA GGCCAGCCTC GTCTACAGAG TGAGTTCCAG 
14341 GACAGCCAGG GCTATACAGA GAAACCCTGT GTCXSAAAAAC CAAAAAAAAA AAAAAAAGTT 
14401 GTTGGCAGAG TGTGGGTTAT ATACCAGGTG GAGATTTCAA ATGAGTGGCT GAAGCTGTAG 
14461 CCAGAAGGAA CTTAGAGGAT AGCTCATAAC TTAAAAAGAA ATGTAGAGAG TAGCAGAAAC 
14521 ATTQAGftGAG TGGGCACACA GCCACTGIGT GAATGTGGCA 6AACACAATC OWGCCAGCTA 
14581 TACATGCATA AQTGTAXATT GGOGOCATCC TGACTGATGA GACACAGGAA AACAGATAGA 
14641 OGGGGTTAGG TGGCCATGGC CrTTCCTGCC TGCCTCITCC TAAGGGTCAT CfCAAGACCT 
14701 TATGCTCXCT TAACTCTTCC ATTGCTACTT AGCTTCTAGA TATCACCTCC AGATTAGTCT 
14761 CCTTGGGTAC ATCAGTGATC CTGGTGAIAT CCAGGGCTTC CTGATTCC31T CTTTGTCATA 
14821 GAGGCTGCAA CTAAAGAGGT CTTCrrAATA CTTCACACCC TGATGCCAAA AGGAAGACAC 
14881 AGAAGTTCAC AGAGGTGAAG TGATTCATGT AGGACATACA GTGAGCAAGC ATCAGGGTCC 
14941 GGATTATCTG ACTCTACTCT AACTTTTATG TAAATGTGCT TTATGCCATT AACACTGTCA 
15001 TTCCTGTGCT TCAGCTCTCG GAGACTCCCA AGCACTCTTA GGCACAAGCC ACAATTAAGG 
15061 GACTCTGACA CTCTGCATXG ATTAATTAGC ATGGTGGTCT CTATGTTTCC AGATTCATCA 
15121 TTGTTTCACT TTCCATAXAG GCTAIGAAGG GTGTGAGGAA ATTTTTTGGG GACAGAATTG 
15181 GAGGCAATCC ACCTCTCTCA GGAAGCCTCT ATCTGGAAAA GCTTACAACT CAGGGACAGT 
15241 AACTGTAGGC CCAGTCCTTG GTGTCCAAAA TGGGTTTTAT GGTTTGAATC TGCAAAGOCT 
15301 IXXATGTGCT CAAAGGTTTG AACATGGAGC CTCCTCCTGG TAACACTGTA TTGGAGGCTT 
15361 TTGAGACTGG ATGCTCTTTG GTCCCMVrr TTGCTACATC ATCTGTCAAG ATATGACCCA 
15421 GGCATGCTAC CAGCTACCAC AGACTATGCC TCTCCAGCTT TCATGTTCTC CCC3VCCATGA 
15481 TAGACTTGTA TCTCCTAAAA ATGGAATCAA AGCAAACTTT TCCTCGATTA AGTTTTTTTT 
15541 TTTCTGTTAA GTGTTTGGTC ACAGGGACAA GAAAACACTC AATACAGATA ATTAGIACCA 
15601 GAGTTGAGGT TCATTGCTCT AGCAAGTTGG ATCAAATTTT TAGGGCTTTG GAACIGATTT 
15661 ATAAGAGACA TGTAGAAGAG TCTGAAGCTG TGGGCTACAG AAGTGICRCC A CXTIITA AG 
15721 AATA6TTTAA TACACCAIGG GAATTGTGAA AATCAGAAX6 CTCACACAAA GGCAGACAGG 
15781 AAAACGTGAG CATGrGGOST 6TGAGAO0GC ATAAGAAGGA AGCTAGGGGO AAAXGAGCTA 
15841 GAAGGCATTC GGCTACGTTA GGGAAOGTGT GTOGCTGrGC TTGGOCCAIG GCCXGGCAAT 
15901 CTGAATGAGG CCAAATTTTA AAGGAGIGGA CTAACXOGAT TGTCAGAGAA AATA3CAAGA 
1S961 CaGACXaCCA CXCAGGCTAT GCOGnXSTTTG TGACOGACXrA GCTACTCTTA GOCftGCTCTA 
16021 TTGXGAAATT CCAGAGCAAT TATCAGAGCA T6AAGAIACA TACAGTTTAG tCGAAOTAAOG 
16081 GGTGTGGGTC CCTAAGXGGA TGGTGCAXAA ATCXATGIAG GTGATGCCTA AGIQACACTT 
16141 GAXAATCXAA AATATCA6CA ATGTGGAATG TCTTCCAAGG AGACCTGTAG ACACACATTT 
16201 TAGAACITTG CTCAIXSGCIG TAAXAMXA6 CIAGCtAGAA ATCATTTOCT 6AAGAGGTTA 
16261 GICIGAGTTA OGGTTOCAGG GCAAACKTTC AGIGA3G6CA AGGAAGGCAT TGCAGTCAGG 
16321 AG0CAAAG6X CAGCIGGTYTA CATXGCATCA AGAGZAGAGA GTCAGAGTGT GAGTAGAAAO 
16381 AGGATACAGG TTAIAAAACC TCACTGTOCA CTCTCAGCAA TOCArTTTCT CCTAAAAGGC 
16441 TTTACCTTCT AAAGATTTTA GTCTTCAAAA CCAGTACCAG TAGCXTTCGGA ACAAAAGTTG 
16501 AAACAAATGA GCXrTTTGTGG GGCATTTCAC ACTTAAAACA GGGCATC310C TAGGAGGAGC 
16561 CXrrGTGTGCA GTAGGAAGTG TGGCCTCTGT GTCAGQAATG CTCAGGCTAA TAAGGGGTCC 
16621 TCTATCTGAG GGACCCTATG AAGATTCAAC AAGtAGTTGT GAGAATTCCX: TGTAAAIGGA 
16681 TGCTACCAAT TTGACATTTG TAGACCTGCT ATTGIGTGCT TCTTTATTGa GCTCTCC5d!AT 
16741 CTCCCAACTT TCCAACCCAT ATTCCACATT AATCOCTTCC ACCACCATGC AACACTAGGT 
16801 AGGAGAGAAG GAAGGTTAGA AGAGAAAGTG GGTAXAGATC TATTTAGACT ACTTCCTGCT 
16861 GATTAGGGGC AAGT<XAATC GTCATTGTCA GGATACCTCC AACCAGCAAC CAGCAAACCA 
16921 GCAAATCAQA AACAGCAAAA GCAOCCAACA AGGCAGCACT AACCAGCAGG ATTGGGGTC6 
16981 GTAGCGTGGG AGCAGTCACT ACTGGTCTTC TCATGOCTTT GGCATTAATA CTCTCTCAAG 
17041 AAATTCX36TA AITTTTTCCC CAC XaCCT GA AATTaXSTAA XTTTAAATGC AAACTATCTA 
X7101 CAGCTGGCAA AAAICACATC TCTOCTAGAG CACAAGACAA ATCATA6TTA CTGGCXATTT 
17161 GGAATCTGAA GCATCTCAAT AIGCCACACC TGGGATTAAA AGAAAAACAT ATTCACATCA 
17221 CATAACTGTT rmrm i JC AATTTTTTAT TAGGTATTTT CTTTATTIAC ATTTCAAAaxS 
17281 CTATCCCGAA AGTCCCXriAT ACCCTCCXIAC CTCCCTGCTC CCCTACACAC CCACTCCXaC 
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Fieare 5 rcontinuedV- 

17341 TTTTTGACCC TGGAGTTCCC CGGTACTGGG GCATATAAAG TTTGCAAGAC CAAGGGGCCT 
17401 CTCTTCCCAG TGATGGCCGA CTAAGCCATC TTCTGCTACA TATGCAGATA GAGACACGAG 
17461 CTCTGGGGGT ACTAGTTAGT TCATATTGTT GTTCCACCTA TAGGGTCGCA GACCCXTTCA 
17521 GCTCCTTGGG TACTTTGTCT AGCTCCTCCA CTGGGGGCTC TGTGTTTTAT CTAATAGATG 
17581 ACTGTGAGCA TCCACTTCTG TATTTGACAG GCACTGGCCT AGCXyTCACAT GAGCCAGCTA 
17641 TATCAGGGTC CTTTCAGCAA AACCTTGCTG GCATGTGCAA TAGTGTCTCC GTTTGGTGGT 
17701 TGATT ATGGG ATGGATCCAC TAGTTCTAGA GCGGCCGCCA CCGCGGTGGA GCTCCAGCTT 
17761 TTGTTCCrTT TAGTGAGGGT TAATTGCGCG CTTGGCGTAA TCATGGTCAT AGCTGTTTCC 
17821 TGTGTGAAAT TGfTTATCCGC TCACAATTCC ACACAACATA CSGAGCCX^GAA GCATAAAGTC 
17881 TA AAGC CTGG GGTGCCTAAT GAGTGAGCTA ACTCACATTA ATTGCGTTCC GCTCACTCCC 
17941 CX3CTTTCCAG TCGGGAAACX: TGTCGTGOCA GCTGCATTAA TGAATCGGCC AACGCGCGGG 
18001 GAGAGGCGGT TTGCX5TATTG GGOGCTCTTC OSCTTCCTCG CTCACTGACT CGCTGCGCTC 
18061 GGTCGTTOSG CTGGGGCGAG CGGTATCAGC TCACTCAAAG GCX3GTAATAC GGTTATCCAC 
18121 AOAATCAGGG GATAACGCAG GAAAGAACAT GTGAGCAAAA GGCCAGCAAA AGGCC3VGGAA 
18181 CCGTAAAAAG GCCGCGTTGC TGGCGTTTTT CCATAGGCTC CGCCCCCCTG ACGAGCATCA 
18241 CAAAAATCX3A CGCTCAAGTC AGAGGTGGOG AAACCCGACA GGACTATAAA GATACCAGGC 
18301 GTTTCCCCCT GGAAGCTCCC TCGTGCX3CTC TCCTGTTCCG ACCCTGCCGC TTACCGGATA 
18361 CCTGTCOGCC TTTCTCCCTT CXX^GAAGCOT GGOGCTTTCT CATAGCTCAC GCTGTAGGTA 
1B421 TCTCAGTTCG GTGTAGGTCG TTC6CTCCAA GCTGGGCTGT GTGCACX3AAC CXXXXX3TTCA 
184 81 GCCCGACCGC TGCXXXTTTAT COGGTAACTA TCX3TCTTGAG TCCAACCCGG TAJ^GACACGA 
1BS41 CTTATCGCCA CTGGCAGCAG CCACTGGTAA CAGGATTAGC AGAGC3GAGGT ATGTAGGOGG 
18601 TGCTACAGAG TTCTTGAAGT GGTGGCCTAA CTAOGGCTAC ACTAGAAGGA CAGTATTTGG 
18661 TATCTGOGCT CTGCXX3AAGC CAGTTACCTT OSGAAAAAGA GTTGGTAGCT CTTGATCCGG 
18721 CAAACAAACC ACCGCTGGTA GOGGTGGTTT TTTTGTTTGC AAGCAGCAfiA TTACG06CAG 
18781 AAAAAAAGGA TCTCAAGAAG ATCCTTTGAT CTTTTCTACG GQGTCTCACG CTCAGTGGAA 
18841 CGAAAACTCA OGTTAAGGGA TTTTGGTCAT GAGATTATCA AAAAGGATCT TCACCTAGAT 
18901 CCrrrTAAAT TAAAAATGAA GTTTTAAATC AATCTAAAGT AXATATGAGT AAACTTGGtTC 
18961 TGACAGTTAC CAA3GCTTAA TCAGTGAGGC ACCXATCTCA GGGATCK3TC TATTTOSTTC 
19021 ATCCMAGTT GCCOXSACTCC COSTCGTGTA GAXAACIAlOG AIACGGGAGG GCTTAOCATC 
19081 TGGCCCCAGT GCTGCAATGA TACOGOGAGA 0CX3V0GCTCA CCGGCTCCftG ATTTATCAGC 
19141 AATAAACCAG COWGCOQGAA GGGCX3GAGCG CAGAAGTGGT CCTGCAACTT TATCCGCCTC 
19201 CATCCA GTCT ATTA A TTCTT OOOOGGAAGC TAGAGTAAGT USTTCCCCMS TTAATAGXTT 
19261 GCGCAAOSTT GTTGCCATTG CTACRGGCAT OGTOGTGTCA OGCTCGTOGT TTGGTATGQC 
19321 TTCATTCAGC TCOGGTTCCC AAOGATCAAG GOGAGTTACA TGATC(XX3CA TOTTGTGCAA 
19381 AAAAOCGGTT AGCTCCTTC3G GTCCTCXXSAT CX3TTGTCAGA AGTAAGTTGG CCGCAGTGTT 
19441 ATCACTCA TG GTTATGGCAG CACTGCATAA TTCTCTTACT GTCATGCCAT CCX3TAAGATG 
19501 CTTTTCTGXG ACTGGTGAGT ACTCAACXAA GTCATTCT6A GAAXAGTGIA TOOXXXsACC 
19561 GAGTTGCTCT TGCCCX3G0GT CAATAOSGGA TAATACC3GCX5 0CAC3lXajSC3l GAACmAAA 
19621 AGTGCTCATC ATTGGAAAAC GrXCXTOGGG GCGAAAACTC TCAAGGATCT TACX3GCTGTT 
19681 GAGATC XAGT TCX5ATGTAAC CCACTCGTGC ACCCAACTGA TCTTCAGCAT CTTTTACTTT 
19741 CAOCAGOGTT TCTGGGnX3AG CAAAAACAGG AAGGCAAAAT GOCGCAAAAA AGGGAATAAG 
19801 GGOGA CaCG G AAAT6TTGAA TACTCATACT CTTCCTTTTT CAATATTATT GAAGCATTTA 
19861 TCAQGGTXM TGTCTCATGA GCGGATACAT ATTTGAATGT ATTTAGAAAA ATAAACAAAT 
19921 AGQGGTTCCG OGCACATTTC COCGAAAAGT GCCACXTAAA TTGTAAGOGT TAATATTTTG 
19981 TCAAAATTCG OGTtAAATTT TTGTTAAATC AGCTCATTTT TTAACCAATA GGCCGAAATC 
20041 GGCAAAATCC CTTAXAAATC AAAAGAATAG AOOGAGATAG GGTTGAGTGT TGTTCCAGTT 
20101 TGGAACAAGA GTCCACTATT AAAGAAOGTG GACTCCAAlOG TCAAAGGGCG AAAAACOGTC 
20161 TATCAGGGCG ATGGCCCACT ACGTGAACCA TCAOCCXAAT CAAGTTTTTT GGGGTC3GAGG 
20221 TGCCGTAAAG CACXAAATCG GAACCCTAAA GGGAGCCCCC GATTTAGAGC TTGACGGGGA 
20281 AAGCCGG06A AOyrGGCJGAG AAAGGAAGGG AAGAAAGOGA AAGGAGCGGG CGCTAGGGOG 
20341 CTGGCAAGTG TAGCGGTCAC GCTGOGCGTA ACCACCACAC CCGCCXSCSGCT TAATGCGCX:G 
20401 CTACAGGGCG CGTCCCATTC GCCATTCAGG CTGCGCAACT GTTGGGAAGG GCGATOQGTG 
20461 CX;GGCCTCTT CX3CTATTACG CCAGCTGGCG AAAGGGGGAT GTGCTGCAAO GCXSATTAAGT 
20521 TGGGTAACGC CAGGGTTTTC CCAGTCACX5A CGTTGTAAAA CGAOGGCCAG T6AG0G0G06 
20581 TAATACGACT CACTAIAGQG CGAATIGGGT ACCGGGCCCC CCC 
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Figure 16 
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Fieure 18: Nucleic acid sequence of the known segment of the RlS/appa+intron plasmid 
including the vector sequences of pBLCAT3 (SEP ID N0:2), 



DNA SYN 15-APR-2000 

with vector cut 13543 to 4954 



LOCUS R15/appa+intron 6708 bp 

DEFINITION RlS/appa-i-intron transgene 
ACCESSION RlS/appa+intron 
REFERENCE 1 (bases 1 to 6708)) 
SOURCE synthetic construct. 

ORGANISM synthetic construct 
artificial sequence, 
salivary proline -rich protein, acid glucose-1 -phosphatase; appA 
gene; periplasmic phosphoairLhydride phosphohydrolase; artificial 
sequence; 

Golovan, S., Forsberg, c.w, , Phillips, J. 
Unpublished. 



KEYWORDS 



AUTHORS 
JOURNAL 



DEFINITION 
ACCESSION 
VERSION 
SOURCE 

ORGANISM 

Mammalia; 

Rattus . 

REFERENCE 
AUTHOI^ 
TITLE 

encoding 

JOURNAL 
MEDLINE 
FEATURES 

soturce 



Rat salivaxry proline-rich protein (RP15) gene. 

M64793 M36414 

M64793.1 GI 5206711 

Rat (Sprague-Dawley) liver DNA. 

Rattus norvegicus 

EuJcaryota; Metazoa; Chordata; Craniata; Vertebra ta; 

Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; 

1 (bases 1 to 1748) 
Lin,H.H. and Ann,D.K. 

Molecular characterization of rat multigene family 

proline-rich proteins 
Genomics 10, 102-113 (1991) 
91257817 

Location/ Qualifiers 
1..1748 

/organisms* Rattus norvegicus** 
/ 8 trains " Sprague -Dawley" 
/db_xref « " taxon : 1 0 116 • 
/ tissue_type= "liver" 
/tissue^libs^cosraid genomic library" 
1802-1810 

/functions" consensus sequence for initiation in 
higher eukaryotes ^ 



misc feature 



FEATURES Location/(^ali f iers 

DEFINITION £. coli periplasmic phosphoanhydride phosphohydrolase (appA) 
gene, 

ACCESSION M58708 L03370 L03371 1*03372 L03373 L03374 L0337S 

VERSION M58708.1 GI: 145283 

SOURCE Escherichia coli DNA. 

ORGANISM Escherichia coli 

Bacteria; Proteobacter la ; gamma subdivision; 
Enterobacter iaceae ; 

Escherichia. 

REFERENC:E ^ l (bases 1811.. 3109) 

AUTHORS Dassa^J. « Marck^C. and Boquet , P . L . 
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Fi^re 18 rcontinued): 

TITLE The complete nucleotide sequence of the Escherichia coli 
gene appA reveals significant homology between pH 2.5 
acid phosphatase and glucose-l~phosphatase 

JOURNAL J. Bacteriol. 172 {9), 5497-5500 (1990) 

MEDLINE 90368616 



FEATURES 

Source 



sigjpeptide 
/genes" appA" 
CDS 



Location/Qualifiers 

1811. .3109 
/organism^" Escherichia coli" 
/db_xref = " taxon : 5 62 " 
1811.. 1876 

1811.. 3103 
/gene="appA" 

/standard_name="acid phosphatase/phytase" 
/ 1 r ans l_tab 1 e= 1 1 

/products "periplasmic phosphoanhydride 
phosphohydrolase ** 
/protein_id="AAA72086 . 1" 
/db_xref=:"GI: 145285- 

/ translat ion= " MKAILIPFLSLLIPLTPQSAFAQSEPELKLESWI VSRHGVRAP 

TIOITQLMQDVTPDAWPTWPVKLGWLTPRGGELIAYLGHYQRQRLVADGU 

GQVAI IADVDERTRKTGEAFAAGLAPDCMTVBTQADTSSPDPI*FNPLK^^ 

imT)AILSRAGGSIADFTGHRQTAFRKLERVLNFPQSNLCLKREKQDE 

ELKVSADNVSLTGAVSLASMLTEIFLLQQAQGMPBPGWGRITDSHQWNTLLSLHNAQF 

YLLQRTPEVARSRATPLLDLIKTALTPHPPQKQAYGVTa^PTSVLFIAGHDTNL^^ 

MiELNVmiPGQPDNTPPGGELVFERWRRLSDNSQWIQVSLVFCr^^ 

PPGEVKLTIAGCBERNAQQ4CSLAGFTQIVNEARIPACSL " 

matjpeptide 1877 3106 

/genes "appA" 

/product="periplasmic phosphoanhydride 
phosphohydrolase 

mutation replace (1817 . . 1819, "gcg changed to gcc") 
/geness^appA" 

/standard_naine=''A3 mutant " 

/note-"created by site directed mutagenesis" 
/phenotype«"silent mutation" 
mutation replace(3092- .3094,* cog changed to ccc") 
/gene=*appA" 

/standard_name«" P428 mutant" 
/note=" created by site directed mutagenesis" 
/phenotype=? silent mutation " 
mutation replace (3095. .3097, " gcg changed to get") 
/gene="appA" 

/standard_narae=" A429 mutant" 

/note«" created by site directed mutagenesis" 

/phenotypes" silent mutation " 
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Figure 18 (continued): 



DEFINITION Plasmid pBLCAT3 (bases 3109 to 6708) 
ACCESSION X644D9 

X64409.i GI:58163 
synthetic construct, 
synthetic construct 
artificial sequence. 

1 (bases 3109 to 6708) 
Luckow,B.H.R. 
Direct Subiaission 

Submitted (06-PEB-1992) B.H.R. Luckow, German Cancer Res 
Center, Im Neuenheimer Feld 280, W-6900 Heidelberg, FRG 

2 (bases 3109 to 6708) 

LuciCGW, B . aiid Schut z , G . — 
CAT constructions with multiple unique restriction sites 



VERSION 
SOURCE 

ORGAinSM 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

REFERENCE 

AUTHORS 
TITLE 



for 



regulatory 

JOURNAL 
MEDLINE 
COMMENT 
experiments 



FEATURES 

source 



the functional analysis of eukaryotic promoters and 
elements 

micleic Acids Res. 15 (13), 5490 (1987) 
87260024 

Promoterless CAT vector for transient trans feet ion 

with eukaryotic cells. Allows the analysis of foreign 
promoters and enhancers. 

Location/Qualifiers 
3109 to 6116 

/organisms "synthetic construct" 
/db xref="taxon:32630" 



3197. •3810 

/notes ••SV40 signals" 
3807, .4047 

/note=''SV40 signals" 
coo^lement (5244 . . 6104 } 
/ codon^s tar t =^ 1 
/ trans l_table 1 
/gene="An5>" 

/product a^bet a - lactamas e " 
/protein_id="CAA457S3 . 1« 
/db_xref x= «GI : 5 8 165 « 

BASE COUNT 1916 a 1479 c ISIS g 1798 t 

ORIGIN 

1 GGATCCCCTT TGCTATGTAG TTTTXAATG6 AAATTACAAC. CCATAGTGTG TTGATAAATA 
61 GAGAGTCCTG TTTGGTTTAA GCAACCTCTG TTTCTC».TAA ACTCCATAAA AACAGGAATA 
121 CTCTTTGTTT CTAGCATAAC CAAAAGATTT AGTGAATTGA AAACAATGTT CCCTTAGAGT 
181 ATAGGTCTAA TAACCCCGAA AATATTACCA TGATACTGAG CATTTGTAAG TATCTCATA6 
241 CATGTAGTAT CCATAGTCCA TCAATGAGAG AGACATTTAA CATGATTTTC ATTAATCAGG 
301 TGGAAAAGAC ATGACAACAT TCACAGCSCAC TGCACAGAAC ATAGTGGTCC ACCTTGCACA 
361 TATTTC^ICTA AACTAGGTTT ATCTATTTTG TTGCTTTCTC TAACATCTCT (KaATGAAGC 
421 AGGTCAACAG TGCCACATAT CCTTTACTTA ACCTAAGGAA CACAAAAAAT TTTCTACATA 
481 TATCCTGGTT AGAGAGTGCT TAAAATAAGT TTTCCAAGAA TGGAAAAGAA ATGTTCTGAC 
541 TTAACAATTA AGACAGTATT TATTTAAAGC AAGAAATATG AGGCACACAA GAAAATATTT 
601 TGGGAAGAAA CCATTTGGTG AACAATATTT CAAATAAAAA TAGACAAACA TAGTTAATTG 
661 TAAAACATAT GTTTGACCAG CCCTTCTTTT CAATAGGCTT AATGTGAATA AAATGTTAAA 
721 GATTCTCTTT GGGTGGCTGC AAATTGTCCA CGAATAAGAC AAAATATAAA AATAAGGACT 
781 GAGTCTCACA AAATGAAAAG (SAAATATATT CAGAAAGAGA ATCTTGAGAG AATGTGTTGT 
841 CACAAATTAAltaAAAACCTG TGGTGAATGA CATCCTGAGG CCTGAGCTAT TACTGACATT 
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Figgre 18 fcontinuedV 

901 TAAGATAAAG GTAACTGTAT ACATTTGTCC CATTGAGGGG ACAAGAAAGC TGCTCTCATG 
961 TTCAGCTCTA TAATTCTTGC CTTAAACAAC TTAAATAGAA TGATTTAAAA TATGGAGCTG 
1021 TCCATGGACC TTTGAAATAT AAAATAGTCA AGCAACTTAT CAAGGAATTA CAGATTCCTT 
1081 GATACTAACA CAGGTAAATC CCACACGTGT TTTGAGACTA CATTTGCTGG GATTTTATTG 
1141 ATGTAATAGG TCACATGTTT TTCGGGCCAA TGTTGCTGTT ATTCGGTTAC TTCAAGAGAA 
1201 TAGTGGCAAC TGATGCTATG TATTCTAGGG GTTTGAAGTG ATGTTTCATG ATTGAAATTT 
1261 GTAAAAGAAT AACATCATCA TTCTTAACAA TAGAACATAT AAAGTCACAC AGAAGTGACA 
1321 GTGTTTAAGC TGTACTATTG ATCAAAGAAA TTTAITACCT TCAGTTTCAA TGGAAATAAT 
1381 TACTGATAAT ACAAACATGT GTGAACACAC ACTAATCCTA TCCAAATGCA CAGTGATACA 
1441 CAGAAAATAT TAGCAAGTAG AATGCAATAT TTATATAACG ATTGTATTTA TCAATCAATT 
1501 GTATGTATCA ATATATGGGC TATTTTCTTA CACATX3ATTT TATTCAAATT TACTCTAATC 
1561 ATTGTTGAAC CATTTAGAAA AGGCATACTG GCAACTTTTC CTTACCTCAT CCAGCTtSGGC 
1621 AAAAGTCCCA GTGTGGAGTA AAGGATGCAA GATTTCCTGC TCTGTTAAGT ATAAAAIAAT 
16B1 AGTATGAATT CAAAGGTGCC ATTCTTCTGC TTCTAGTTAT AAAGGCAGTG CTTGCTTCTT 
1741 CCAGCACAGA TCTGGATCTC GAGGAGCTTG GCGAGATTTT CAGGAGCTAA GGAAGCTAAA 
1801 AGCCGCCACC ATGAAAGCCA TCTTAATCCX: ATTTTTATCT CTTCTGATTC CGTTAACCCC 
1861 GCAATCTGCA TTCGCTCAGA GTGAGCCGGA GCTGAAGCTG GAAAGTCTGG TGATTGTCAG 
1921 TCGTCATGGT GTGCGTGCTC CAACCAAGGC CACGCAACTG ATGCAGGATG TCACCCCAGA 
1981 CX5CATGGCCA ACCTGGCXGG TAAAACTGGG TTGGCTGACA CXGCGCGGTG GTGAGCTAAT 
2041 CGCCTATCTC GGACATTACC AACGCCAGCX3 TCTGGTAGCC GACGGATXGC TGGCGAAAAA 
2101 GGGCTGCX:CG CAGTCTGGTC AGGTCGCGAT TATTGCTGAT GTCGACGAGC GTACCCGTAA 
2161 AACAGGCGAA GCCTTCGCCG CCGGGCTGGC ACCTGACTGT GCAATAACCG TACATACCCA 
2221 GGCAGATACG TCCAGTCCCG ATCCGTTATT TAATCCTCTA AAAACTGGCG TTTGCCAACT 
2281 GGATAACGCG AACX5TGACTG ACX3CGATCCT C3U3CAGGGCA GGAGGGTCAA TTGCTGACTT 
2341 TACCGGGCAT CGGCAAACXIG CGTTTCGCGA ACTGGAACGG GTGCTTAATT TTCCGOIATC 
2401 AAACTTGTGC CTTAAACGTG AGAAACAGGA CX5AAAGCTGT TCATXAACGC AGGCATTACC 
2461 ATCGGAACTC AAGGTGAGCG CCGACAATGT CTCATTAACC GGTGC3GGTAA GCCTCGCATC, 
2521 AATGCTGACG GAGATATTTC TCCTGCAACA AGCACAGGGA ATGCCGGAGC CGGGGTGGGG 
2581 AAGGATCACC GATTCACACC. AGTG6AACAC CTTGCTAAGT TTGCATAACG CGCAATTTTA 
2641 TTTGCTACAA CGCACGCCAG AGGTTGCCCG CAGCCGCGCC ACCCCGTTAT TAGATTTGAT 
2701 CAAGACAGCG TTGACGCCCC ATCCACCGCA AAAACAGGCG TATGGTGTGA CATTACCCAC 
2761 TTCAG1X3CTG TTTATCGCCG GACACGATAC TAATCTGGCA AATCTCGGCG GC6CACTGGA 
2821 GCTCAACTGG ACGCTTCCCG GTGAGCCGGA TAACACGCCG CCAGGTGGTG AACTGGTGTT 
2881 TGAAOGCTGG CGTQGGCTAA GCGATAACAG CCA(3TGGATT CAGGTTTCGC TGGTCTPCCA 
2941 GftCTTTACAG CAGATGCGTG ATAAAACGCC GCTGTCATTA AATACGCOGC CCGOAGAGGT 
3001 GAAACT6ACC CTGGCAGGAT GTGAAGAGCG AAATGCGCAG GGCATGT6TT CGTTGGCAGG 
3061 TTTXACGCAA ATCGTGAATG AAGCACGCAT ACCCGCTTGC AGTTTGTAAG GTATAAGGCA 
3121 6TTATTGGTG CCCTTAAACG CCTGGTGCTA CGCCTGAATA AGTGATAATA AGCGGATGAA 
3181 TGGCAGAAAT TCGCCGGATC TTTGTGAAGG AACCTTACTT CTGTGGTGTG ACATAATTGG 
3241 ACAAACXACC TACAGAGATT TAAAGCTCTA AGGTAAATAT AAAATTTTTA AGTGTATAAT 
3301 GTCTTAAACT ACTGATTCTA ATTGTTTGTG lATTTTAGAT TCCAACCTAT GGAACTCATG 
3361 AATGGGAGCA GTGGTGGAAT GCCTTTAATG AGGAAAACCT GTTTTGCTCA GAAGAAATGC 
3421 CATCTA6TGA TGATGAGGCT ACTGCTGACT CTCAACATTC TACTCCTCCA AAAAAGAAGA 
3401 GAAAGGTAGA A6ACCCCAAG GACTTTCCTT CAGAATTGCT AAGTTTTTTG AGTCATGCTG 
3541 T6TTTAGTAA TAGAACTCTT GCTTGCTTTG CTATTTACAC CACAAAGGAA AAAGCTGCAC 
3601 TGCTATACAA 6AAAATTAIG QAAAAATATT CTGTAACCTT TATAA0TAGG CATAACAGTT 
3661 ATAATCATAA CATACTGTTT TTTCTTACTC CACACAGGCA TAGAGTGTCT GCTATTAATA 
3721 ACTATGCTCA AAAATTGTGT ACCTTTAGCT TTTTAATTTG TAAAGGGGTT AATAAGGAAT 
3781 ATTTGATGTA TAGTGCCTTG ACTAGAGATC ATAATCAGCC ATACCACATT TGTAGAGGTT 
3841 TXACTTGCrr TAAAAAACCT CCCACACCTC CCCCTGAACC rTGAAACATAA AATGAMGCA 
3901 ATTGTTGTTG TTAACTTGTT TATTGCAGCT TATAATGGTT ACAAATAAAG CAATAGCATC 
3961 ACAAATTTCA CAAATAAAGC ATTTTTTTCA CTGCATTCTA GTTGTGGTTT GTCCATIACTC 
4021 ATCAATGTAT CTTATCATGT CTGGATCGAT CCCCGGGTAC CGAGCTCGAA TTCGTAATCA 
4081 TGGTCATAGC TGTTTCCTGT GTGAAATTGT TATCCGCTCA CAATTCCACA CAACATACGA 
4141 GCCGGAAGCA TAAAGTGTAA AGCCTGGGGT GCCTAATGAG TGAGCTAACT CACATTAATT 
4201 GCGTTGCGCT CACTGCCCGC TTTCCAGTCG GGAAACCTGT CGTGCCAGCT GCATTAATGA 
4261 Xtcggccaac GCGCGGGGAG AGGCGGTTTG CGTATTGGGC GCTCTTCCGC TTCCTCOCTC 
4321 actgactcgc tgCgctcggt CGTTCGGCTG CGGCGAGCGG TATCAGCTCA CTCAAAGGCG 

33/58 



RECTIFIED SHEET (RULE 91) 



wo 00/64247 



PCT/CAOO/00430 



Figure 18 (continued): 

4381 GTAATACGGT TATCCACAGA ATCAGGGGAT AACGCAGGAA AGAACATGTG AGCAAAAGGC 
4441 CAGCAAAAGG CCAGGAACCG TAAAAAGGCC GCGTTGCTGG CGTTTTTCCA TAGGCTCCGC 
4501 CCCCCTGACG AGCATCACAA AAATCGACGC TCAAGTCAGA GGTGGCGAAA CCCGACAGGA 
4561 CTATAAAGAT ACCAGGCGTT TCCCCCTGGA AGCTCCXTTCG TGCGCTCTCC T6TTCCGACC 
4621 CTGCCGCTTA CCGGATACCT GTCCX5CCTTT CTCCCTTOGG GAAGCGTGGC GCTTTCTCAA 
4681 TGCTCACGCT GTAGGTATCT CAGTTCGGTG TAGGTCGTTC GCTCCAAGCT GGGCTGTGTG 
4741 CACGAACX:CC CCGTTCAGCC CGACCGCTGC GCCTTATCCG GTAACTATCG TCTTGAGTCC 
4801 AACCCGGTAA GACACXSACTT ATC3GCCACTG GCAGCAGCCA CTGGTAACAG GATTAGCAGA 
4861 GCX3AGGTATG TAGGCGGTGC TAC31GAGTTC TTGAAGTGGT GGCCTAACTA CGGCTACACT 
4921 AGAAGGACAG TATTTGGTAT CTGCGCTCTG CTGAAGCCAG TTACCTTCGG AAAAAGAGTT 
4981 GGTAGCTCTT GATCCGGCAA ACAAACCACC GCTGGTAGCG GTGGTTTTTT TGTTTGCAAG 
5041 CAGOVGATTA CGCGCAGAAA AAAAGGATCT CAAGAAGATC CTTTGATClT TTCTACGGGG 
5101 TCTGACGCTC AGTGGAACGA AAACTCACGT TAAGGQATTT TGGTCATGAG ATTATCAAAA 
5161 AGGATCTTCA CCTAGATCCT TTTAAATTAA AAATGAAGTT TTAAATCAAT CTAAAGTATA 
5221 TATGAGTAAA CTTGGTCTGA CAGTTACCAA TGCTTAATCA GTQAGGCACC TATCTCAGCG 
5281 ATCTGTCTAT TTCGTTCATC CATAGTTGCC TGACTCCCCG TCGTGTAGAT AACTACGATA 
5341 CGGGAGGGCT TACCATCTGG CCCCAGTGCT GCAATGATAC CGCGAGACCC ACGCTCACCG 
5401 GCTC CAGAT T TATCAGCAAT AAACCAGCCA GCCGGAAGGG CCGAGCCCAG AAGTGGTCCT 
5461 GCAACTTTAT CCGCCTCCAT CCAGTCTATT AATTGTTGCC GGGAAGCTAG A6TAAGTAGT 
5521 TCGCCAGTEA ATAiGTTTGCXS CAACGTTGTT GCCATTGCTA CAGGCATCGT GGTCTCAOSC 
5581 TCGTCGTTTG GTATGGCTTC ATTCAGCTCX: GGTTCCCAAC GATCAAGGCG AGTTACATGA 
5641 TCCCCCATGT TGTGCAAAAA AGCGGTTAGC TCCTTCGGTC CTCCX3ATCGT TGTCAGAAGT 
5701 AAGTTGGCCG GAGTCtTTATC ACTCATGGTT ATGGCAGCAC TGCATAATTC TCTTACTGTC 
5761 ATGCCATCCG TAAGATGCTT TTCTGTGACT GGTGAGTACT CAACCAAGTC ATTCTGAGAA 
5821 TAGTGTATGC GGCGACCGAG TTGCTCTTGC CCGGCGTCAA TACGGGATAA TACCGCGOCA 
5881 CATAGCAGAA CTTTAAAAGT GCTCATCATT GGAAAAlCQTT CTTCOGGGCG AAAACTCTCA 
594X AGGATCTTAC CGCT6TTGAG ATCXaUSTTCG ATGTAACCCA CTCGTX3CACC CAACTGATCT 
6001 TCAGCATCTT TTACTTTCAC CAGCGTTTCT GGGTGAGCAA AAACAGGAAG GCAAAATGCC 
6061 GCAAAARAGG GAATAAGGGC GACACGGAAA TGTTGAATAC TCATACTCTT CCTTTTTCAA 
6121 TATTATTGAA GCATTTATCA GGGTTATTGT CTCATGAGCG GATACATATT TGAATGTATT 
6181 TAGAAAAATA AACAAATAGG GGTTCCGCGC ACATTTCCCC GAAAAGTGCC ACCTGACGTC 
6241 TAAGAAACCA TTATTATCAT GACATTAACC TATATUIAATA GGCGTATCAC GAGGCCCTTT 
6301 CX5TCTCGCGC GTTTCGGTGA TGACX5GTGAA AACCTCTGAC ACATGCAGCT CCCX5GAGA0G 
6361 CTCACAGCTT GTCTGIAA6C GGATGCXXSGG A6CAJ8ACAAG CCOGTCAGGG CGCGTCA6C6 
6421 GGTOrrGGCG G6TGT0G6GG CTGGCTXAAC TATGOGGCAT CAGAGCAGAT TGTACTGAGA 

6481 gtgcaccata tgcggtgtga aataccgcac agatgcx5taa ggagaaaata ccgcatcagg 
6541 cgcx:attcgc cattcaggct gcgcaactgt tgggaagggc gatcggtgcg ggcctcttcg 
6601 ctattacx3cx: agctggcgaa agggggatgt gctgcaaggc gattaagttg ggtaacgcca 
6661 gggttttccc agtcacgacg txx?taaaacxs acggccagtg ccaagctt 
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Figttre 19: Nucleic acid sequence of the known segment of the RlS/appa+iatron transgene 
used for the generation of transgenic mice fSEO ID NO: 3\ 

LOCUS R15/appa 4060 bp DNA SYN 15-APR-2000 

DEFINITION R15/appa transgene without vector 

ACCESSION RlS/appa 

REFERENCE 1 (bases 1 to 4060) 

SOURCE synthetic construct. 

ORGANISM synthetic construct 
artificial sequence. 
KEYWORDS salivary proline -rich protein, acid glucose- 1 -phosphatase; appA 

gene; periplasmic phosphoanhydride phosphohydrolase; artificial 

sequence; 

AUTHORS Golovcui, S. , Forsberg, C.W., Phillips, J. 
JOURNAL Unpublished. 

DEFINITION Rat salivary proline-rich protein (RP15> gene. 
ACCESSION M64793 H36414 
VERSION M64793,l 01:206711 

SOURC^ Rat (Sprague-Dawley) liver DNA. 

ORGANISM Rattus norvegicus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 

Mammalia ; 

Butheria; Rodentia; Sciurognathi ; Muridae; Murinae; 

Rattus . 

REFERENCE 1 (bases 1 to 1748) 
AUTHORS Lin,H.H. and Ann^D.K. 

TITLE Molecular characterization of rat multigene family 
encoding 

proline-rich proteins 
JOURNAL Genomics 10, 102-113 (1991) 
MEDLINE 91257617 
FEATURES Location/Qualifiers 
source 1. .1748 

yorganiBro= *Rattus norvegicus* 
/strain= •* Spxague -Dawley " 
/db_xref = " taxon : 1 0116 
/ tissue_type= " liver" 
/tissue_lib="cosmid genomic library" 
raisc_feature 1802-1810 

/functions" consensus sequence for initiation in 
higher euJcaryotes * 

FEATURES location/Qualifiers 

DEFUIITION E. coli periplasndc phosphoanhydride phosphohydrolase (appA) 
gene, 

ACCESSION M5870B L03370 L03371 Ii03372 L03373 L03374 L03375 

VERSION MSe708.1 GI: 145283 

SOXJRCE Escherichia coli DNA. 

ORGANISM Escherichia coli 

Bacteria; Proteobacteria; gamma subdivision; 
Enterobacteriaceae ; 

Escherichia. 

REFERENCE 1 (bases 1811.3109) 

AUTHORS Dassa,J., Marck,C. and Boquet,P.L. 
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Fignre 19 fcontinued): 

TTTT.Tg The complete nucleotide sequence of the Escherichia coli 
gene appA reveals significant homology between pH 2.5 
acid phosphatase and glucose-1 -phosphatase 

JOURNAL J. Bacteriol. 172 (9), 5497-5500 (X990) 

MEDLINE 90368616 



FEATURES 

Source 



sig_peptide 
/genes *appA" 



CDS 



Location/Qualif iers 

1811- ,3109 
/organisms "Escherichia coli" 
/db_xref =" taxon : 562 " 
18ir . 1876 

1811. ,3109 

/gene="appA'* 

/ standard_naine= " acid phosphatase /phytase " 
/ trans l_t able* 1 1 

/products "periplasmic phosphoanhydrlde 

phosphohydrolase •* 

/pro tein_id« « AAA7 2086.1" 

/db_xref="GI:14528S« 

/translation="MKAILIPFLSLLIPLTPQSAFAQSEPELKLESWIVSRHGVRAP 

TKATQLMQDVTPDAWPTWPViaXJWLTPRGGELIAYI/anfQRQRLVArK^^ 

GQVAIIADVDERTRKTGEAFAAGLAPDCAIT\mTQADTSSPDPLFinPLKTGfVCQLDl^ 

NVTDAILSRAGGSIADFTGHRQTAFRELERVLNFPQSNLCLKREKQDESCSLTQALPS 

ELKVSADNVSLTGAVSIiAShOjTEIFLLQQAQCaiPEPGWGRITDSHQWNTLLSLm^ 

YLIiQRTPEVARSRATPLLDLIKTALTPHPPQKQAYGVTLPTSVLFIAGHiyn^^ 

MiELWirrLPGQPDNTPPGGELVFERWRRLSmSQWIQVSLVFQTLQQMR^ 

PPGEVKLTLAGCEERllAQGMCSLAGFTQIVNBARIPACSL" 

iiiatj>ept:ide 1877 3106 

/genea*appA" 

/products •periplastaic phosphoanhydrlde 
phosphohydrolase ** 

mutation replace (1817. . 1819, "gcg changed to gcc-) 
/genee"appA" 

/staiidard_jaames*A3 mutant" 

/note= "created by site directed mutagenesis" 
/phenotypes'* silent mutation" 
rautatiop replace (3092. .3094," ccg changed to ccc") 
/gene«"appA" 

/standard_name=" P428 mutant" 
/note=" created by site directed mutagenesis" 
/phenotype=" silent mutation " 
mutation replace (3095. .3097," gcg changed to get") 
/genec"appA" 

/standard^names" A429 mutant" 

/note=" created by site directed mutagenesis'* 

/phenotypes" silent mutation " 
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Fif ure 19 (continued): 

SV4 0 t intron 
polyA_signal 



3197.-3810 

/note«"SV40 signals'* 
3807. .4047 

/note="SV40 signals" 



BASE COUNT 1257 a 814 C 843 g 1146 t 

ORIGIN 

1 GGATCCCCTT TGCTATGTAG TTTTTAATGG AAATTACAAC CCATAGTGTG TTGATAAATA 
61 GAGAGTCCTG TTTGGTTTAA GCAACCTCTG TTTCTCATAA ACTCCATAAA AACAGGAATA 
121 CTCTTTGTTT CTAGCATAAC CAAAAGATTT AGTOAAT-IGA AAACAATGTT CCCTTAGAGT 
181 ATAGGTCTAA TAACCCCGAA AATATTACCA TGATACTGAG CATTTGTAAG TATCTCATAG 
241 CATGTAGTAT CCATAGTCCA TCAATGAGAG AGACATTTAA CATGATTTTC ATTAATCAGG 
301 TGGAAAAGAC ATGACAACAT TCACAGGCAC TGCACAGAAC ATAGTGGTCC ACCTTGCACA 
361 TATTTCACTA AACTAGGTTT ATCTATTTTG TTGCTTTCTC TAACATCTCT GCAATGAAGC 
421 AGGTCAACAG TGCCACATAT CCTTTACTTA ACCTAAGGAA CACAAAAAAT TTTCTACATA 
481 TATCCTGGTT AGAGAGTGCT TAAAATAAGT TTTCCAAGAA TGGAAARGAA ATCTTCTGAC 
S41 TTAACAATTA AGACAGTATT TATTTAAAGC AAGAAATATG AGGCACACAA GAAAATATTT 
€01 TGGGAAGAAA CCATTTGGTG AACAATATTT CAAAXAAAAA TAGACAAACA TAGTTAATTG 
661 TAAAACATAT GTTTCACCAG CCCTTCTTTT CAATAGGCTT AATGTGAATA AAATGTTAAA 
721 GATTCTCTTT GGGTGGCTGC AAATTGTCCA CGAATAAGAC AAAATATAAA AATAAGGACT 
781 GAGTCTCACA AAATGAAAAG GAAATATATT CAGAAAGAGA ATCTTGAGAG AATGTGTTGT 
841 CACAAATTAA AGAAAACCTG TGGTGAATGA CATCCTGAGG CCTGAGCTAT TACTGACATT 
901 TAAGATAAAG GTAACTGTAT ACATTTGTCC CATTGAGGGG ACAAGAAAGC TGCTCTCATG 
961 TTCAGCTCTA TAATTCTTGC CTTAAACAAC TTAAATAGAA TGATTTAAAA XATGGA6CTG 
1021 TCCATGGACC TTTGAAATAT AAAATAGTCA AGCAACTTAT CAAGGAATXA CA GATTC CTT, 
1081 GATACTAACA CAGGTAAATC CCACACGTGT TTTOAGACTA CATTTGCTGG GATTTTATTG 
1141 ATGTAATAGG TCACATGTTT TTCGGGCCAA TGTTGCTGTT ATTCGGTTAC TTCAAGAGAA 
1201 TAGIGGCAAC TGATGCTATG TATTCTAGGG GTTTGAAGTG ATGTTTCATG ATTGAAATTT 
1261 GTAAAAGAAT AACATCATCA TTCTTAACAA TAGAACATAT AAA GTCA CAC AGAAGTGACA 
X321 GTGTTTAAGC TGTACTATTG ATCAAAGAAA TTTATTACCT TCAGTTTCAA TGGAAATAAT 
1381 TACIGATAAT ACAAACAIGT GTGAACACAC ACTAATCCTA TCCAAATGCA CAGTGATACA 
1441 CAGAAAAXAT TAGCaAGTAG AATGCAA3AT TTATATAAC3G MTGTftXTTA TCAATCAATT 
1501 GTATGTATCA ATATATGGGC TATTTTCTTA CACATGATTT TATTCAAAaT TACTCTAATC 
1561 ATTGTTGAAC CATTTAGAAA AGGCATACTG GCAACTTTTC CTTACCTCAT CCAGCTGGGC 
1621 AAAAGTCX:CA GTGTGGAGTA AAGGATGCAA GATTTCCTGC TCTGTTAAGT ATAAAATAAT 
1681 AGTATGAATT CAAAGGXGCC ATTCTTCTGC TTCTAGTTAT AAAGGCAGTG CTTGCTTCTT 
1741 CCAGCACAGA TCTGGATCTC GAGGAGCTTG GCGAGATTTT CAGGAGCTAA GGAAGCTAAA 
1801 AGCCGCCACC ATGAAAGCCA TCTTAATCCC ATTTTTATCT CTTCTGATTC CGTTAACCXIC 
1861 GCAATCXGCA TTCGCTCAGA GTGAGCCGGA GCTGAAGCTG GAAAGTGTGG TGATTGTCAG 
1921 TCGTCATGGT GTGC6TGCTC CAACCAAGGC CACGCAACTG ATGCAGGATG TCACCCCAGA 
1981 CGCATGGCCA ACCTGGCCGG TAAAACTGGG TT6GCTGACA CCGCGCGGXG GTGAGCTAAT 
2041 aSOHMCTC GGACATTACC AACGCCAGCG TCTGGTAGCC GAOSGATTGC T6GCGAAAAA 
2101 GGGCT6CCCG CAGTCTGGTC AGGTCGCGAT TATTGCTGAT GTCGACX3AGC GTACCCGTAA 
2161 AACAGGC3GAA GCCTTCGCCG CCGGGCTGGC ACCTOACTGT GCAAIAACCX5 TACATACCCA 
2221 GGCAGATACG TCCAGTCCCG ATCCGTTATT TAATCCTCTA AAAACXGGCX5 TTTOCCAACT 
2281 GGATAACGCG AACGTGACTG ACGCGATCCT CAGCAGGGCA GGAGGGTCAA TTGCTGACTT 
2341 TACCX3GGCAT CGGCAAACGG CGTTTCGCGA ACTGGAACGG GTGCTTAATT TTCCGCAATC 
2401 AAACTTGT6C CTTAAACXSTG AGAAACAGGA CX3AAAGCTGT TCATTAACX5C AGGCATTACC 
2461 ATCGGAACTC AAGGTGAGCG CCGACAATGT CTCATTAACC GGTGOGGTAA GCCTCGCATC 
2521 AATGCTGACX3 GAGATATTTC TCXnXSCAACA AGCACAGGGA ATGCXXK3AGC CGGG GTGGGG 
2581 AAGGATCACC GATTCACACC AGTGGAACAC CTTGCTAAGT TTGCMAACG CGCAATTTTA 
2641 TTTGCTACAA CGCACGCCAG AGGTTGCCCG CAGCCGCGCC ACCTCGTTAT TAGATTTGAT 
2701 CAAGACAGCG TTQACGCCCC ATCCACCGCA AAAACAGGCG TATGGTGTGA CATTACCXJIC 
2761 TTCAGTGCTC TTTATCGCCG GACACGATAC TAATCTGGCA AATCTCGGCX3 GCGCACTGGA 
2821 esCTCAACTGG ACGCTTCCCG GTGAGCCGGA TAACACGCCG CCA GGTGG TG AACTGGTGTT 
2881 TGAACGCTGG CGTCGGCTAS. GCGATAACAG CCAGTGGATT CAGGTTTCGC TGGTCTTCCA 

37/58 



RECTIFED SHEET (RULE 91) 



wo 00/64247 



PCT/CAOO/00430 



Figure 19 fcontmued): 

2941 GACTTTACAG CAGATGCGTG ATAAAACGCC GCTGTCATTA AATACGCCGC CCGGAGAGGT 
3001 GAAACTGACC CTGGCAGGAT GTGAAGAGCG AAATGCGCAG GGCATGTGTT CGTTGGCAGG 
3061 TTTTACGCAA ATCGTGAATG AAGCACGCAT ACCCGCTTGC AGTTTGTAAG GTATAAGGCA 
3121 GTTATTGGTG CCCTTAAACG CCTGGTGCTA CGCCTGAATA AGTGATAATA AGC6GATGAA 
3181 TGGCAGAAAT TCGCCGGATC TTTGTGAAG6 AACCTTACTT CTGTGGTGTG ACATAATTGG 
3241 ACAAACTACC TACAGAGATT TAAAGCTCTA AGGTAAATAT AAAATTTTTA AGTGTATAAT 
3301 GTGTTAAACT ACTGATTCTA ATTGTTTGTG TATTTTAGAT TCCAACCTAT GGAACTGATG 
3361 AATGGGAGCA GTGGTGGAAT GCCTTTAATG AGGAAAACCT GTTTTGCTCA GAAGAAATGC 
3421 CATCTAGTGA TGATGAGGCT ACTGCTGACT CTCAACATTC TACTCCTCCA AAAAAGAAGA 
3481 GAAAGGTAGA AGACCCCAAG GACTTTCCTT CAGAATTGCT AAGTTTTrTG AGTCATGCTG 
3541 TQTTTAQTAA TAGAACTCTT GCTTGCTTTG CTATTTACAC CACAAAGGAA AAAGCTGCAC 
3601 TGCTATACAA GAAAATTATG GAAAAATATT CTGTAACCTT TATAAGTAGG CATAACAGTT 
3661 ATAATCATAA CATACTGTTT TTTCTTACTC CACACAGGCA TAGAGTGTCT GCTATTAATA 
3721 ACTATGCTCA AAAATTGTGT ACCTTTAGCT TTTTAATTTG TAAAGGGGTT AATAAGGAAT 
3781 ATTTGATGTA TAGTGCCTTG ACTAGAGATC ATAATCAGCC ATACCACMT TGTAGAGGTT 
384X TTACTTGCTT TAAAAAACCT CCCACACCIC CCCCTGAACC TGAAACATAA AATGAATGCA 
3901 ATTGTT6TTG TXAACTTGTT TATTGCAGCT TATAATGGTT ACAAATAAAG CAATAGCATC 
3961 ACAAATTTCA CAAATAAAGC ATTTTTTTCA CTGCATTCTA 6TTGT66TTT GTCCAAACTC 
4021 ATCAATGTAT CTTATCATGT CTGGATCGAT CX!CCGGGTAC 

// 
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Figure 20: Nucleic acid sequence of the known segment of the RlS/appa plasmid rincluding 
the vector sequences of dBLCAT3 (SEP ID N0:4V 



LOCUS RlS/appa 6116 bp DNA SYN lS-APR-2000 

DEFINITION RlS/appa traiiisgene with vector 

ACCESSION RlS/appa 

REFERENCE 1 (bases 1 to 6116) 

SOURCE synthetic construct . 

ORGANISM synthetic construct 
artificial sequence, 
salivary proline -rich protein, acid glucose-1 -phosphatase; appA 
gene; periplasraic phosphoanhydride phosphohydrolase ; artificial 
sequence ; 

Golovan« S., Forsberg, C.n., Phillips, J. 
Unpublished. 



KEYWORDS 



AUTHORS 
JOURNAL 



DEFINITION Rat salivary proline-rich protein (RP15) gene. 
ACCESSION M64793 M36414 
VERSION M64793-1 GI: 206711 

SOURCE Rat (Sprague-Dawley) liver DNA. 

ORGANISM Rattus norvegicus 

Eukaryota; Metazoa; Chorda ta; Craniata; Vertebrata; 

Eutheria; Rodent ia; Sciurognatbi; Muridae; Murinae; 

1 (bases 1 to 1748) 
Lin,H.H. and Ann,D.K. 

Molecular characterization of rat multigene family 



Mammalia; 

Rattus . 

REFERENCE 

AUTHORS 

TITLE 
encoding 

JOURNAL 
MEDLINE 
FEATURES 

source 



miec feature 



proline-rich proteins 
Genomics 10, 102-113 (1991) 
91257817 

Loca t ion/Qual i f i er s 
1. -1748 

/ organisms: »Rattus norvegicus " 
/strains "Sprague-Dawley* 
/db^xref^^taxon: 10116" 
/ 1 is sue_type= " 1 iver *• 
/tissue_lib="cosmid genomic library" 

1802-1810 

/function*" consensus sequence for initiation in 
higher eukaryotes * 



FEATURES Location/ Qualifiers 

DEFINITION E. coli periplasmic phosphoazxhydride phosphohydrolase (appA) 
gene, 

ACCESSim MS8708 L03370 L03371 L03372 L03373 L03374 L0337S 
VERSICMI MS8708.1 GI -.145283 

SOURCE Escherichia coli DNA. 

ORGANISM Escherichia coli 

Bacteria; Proteobacteria; gamoa subdivision; Enterobacteriaceae ; 

Escherichia. 

REFERENCE 1 (bases 1811..3109) 

AUTHORS Dassa,J., Marck,C. and Boquet,P.L. 

TITLE The complete nucleotide sequence of the Escherichia coli gene appA 
reveals significant homology between £iH 2.5 acid phosphatase 
and glucose- 1 -phosphatase 

JOURMAL J. Bacteriol. 172 (sT. S497-S500 (1990) 
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Figare 20 (continued): 

MEDLIHE 9036B616 

FEATURES 

Source 



sig_peptide 

/genea"appA*' 

CDS 



Location/Qualifiers 

1811. .3109 
/organism- "Escherichia coli" 
/ db^xref = " t axon : 5 6 2 " 
1811,. 1876 

1811. ,3109 

/gene=*'appA" 
/standard_name=''acicl phosphatase/phytase" 
/transl_table=ll 

/ produc t = "peripl asmic phosphoaivhydride phosphohydrolase • 

/protein_id="AAA72086.1» 

/db_xref-"GI: 145285'* 

/ 1 ranslat ion= "MKAILIPFI^LIilPLTPQSAFAQSEPELKLESVVrVSRHGVRAP 

7KATQLMQDVTPDAWPTWPVKLG»I#TPRGGELIAYIiGHYQRQR^ 

GQVAXXADVDERTRKIGEAPAAGLAPDCAITVHTQADTSSFDPL^ 

NVTDAtIiSRAGGSXADFTG«RCn'AFR£LERVLKFPQSMLCLKREKQDESCSLTQ^ 

ELKVSADNVSLTGAVSIASMLTEIFLLQQAQGMPEPGHGRITDSHOKin^ 

nJORTPEVARSRATPLU)LZKTALTPHPPQKQAY6VTLPTSVIiFIAG3IDT^^ 

AI£LKWTLPGQPDin!PFGK3ELVFERWRRI^mSQVaQVSLW 

PPGEVKLTIAGCEERNAQGMCSLAGFTQrVNEARIPACSL • 
inat_peptide 1877 3106 

/gene^^appA" 

/products "periplasmic phosphoanhydride phosphohydrolase « 



mat at ion 



mutatxon 



nutatxon 



replace (1817 - . 1819, "gcg changed to gcc") 
/gene=*appA" 

/standard^naine^i^AS noitant* 

/notes "created by site directed mutagenesis* 

/phenotype=* silent mutation* 

replace (3092. .3094, " ccg changed to ccc*) 

/gene="appA'' 

/standard_name-" P428 mutant" 

/notes ^created by site directed mutagenesis" 

/phenot:ypes" silent mutation " 

rqplace(3095..3097, " gcg changed to .get") 

/genee"appA" 

/standardjoamee" A429 mutant* 

/iiot:e»** created by site directed mutagenesis* 

/phenotype=" silent mutation • 



DEFINITION Plasraid pBLCAT3 (basQS 3109 to 6116) 



ACCESSION X64409 

VERSION X64409.1 61:58163 

SOURCE synthetic construct. 

ORGANISM synthetic construct 
artificial sequence - 
REFERENCE 1 (bases 3109 to 6116) 

AUTHORS Luckow,B.H.R. 

TITLE Direct Submission 

JOURNAL Submitted (06-FEB-1992) B-H.R. Luckow, German <::ancer Res 
Center, Im Neuenhei/tier Feld 280, W-6900 Heidelberg, ?RG 
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Figure 20 fcontmued): 



REFERENCE 
AXTTHORS 
TITLE 

for 

regulatory 

JOURNAL 
MEDLINE 
COMKENT 
experimeats 



FEATURES 

source 



polyA_signal 



2 (bases 3109 to €116) 
Luckow,B. and Schutz,G. 

CAT constructions with ntultiple unique restriction sites 
the functional analysis of eukaryotic promoters and 
elements 

Nucleic Acids Res. IS (13), S490 (1987) 
87260024 

Promoterless CAT vector for transient transfection 

with eukaryotic cells. Allows the analysis of foreign 
promoters and enhancers , 

Location/ Qualifiers 
3109 to 6116 

/organiEm=« synthetic construct" 
/db_xref = taxon : 3 2 63 0 • 
3262. ,3457 

/note«"SV40 signals" 



CDS cotr^lement (4654 . «5514) 

/codon_start=l 
/transl_table«ll 
/gene="An^" 

/product = "beta - lactamase " 
/protein_id="CAA457S3 • 1" 
/dbjxref ="GI : S 81 65 " 



BASE COUNT 1724 a 1386 c 1407 g 1599 t 

ORIGIN 

1 GGATCCCCTT TGCTATGTAG TTTTTAATGG AAATTACAAC CCATAGTGTG TTGATAAATA 
61 GAGAGTCCTG TTTGGTTTAA GCAACCTCTG TTTCTCATAA ACTCCATAAA AACAGGAATA 
121 CTCTTTGTTT CTAGCATAAC CAAAAGATTT AGTGAATTGA AAACAATGTT CCCTTAGAGT 
181 ATAGGTCTAA TAACCCOGAA AATATTACCA TGATACTGAG CATTTGTAAQ TATCTCATAG 
241 CATGTAGTAT CCATAGTCCA TCAATQAGAG AGACATTTAA CATOATTTTC ATTAATCAGG 
301 TGGAAAAGAC ATGACAACAT TCACAGGCAC TGCACAGAAC ATAGTGGTCC ACCTTGCACA 
361 TATTTCACTA AACTAGGTTT ATCTATTTTG TTGCTTTCTC TAACATCTCT GCAATGAAGC 
421 AGGTCAACAG TGCCACAIAT CCTTTACTTA ACCTAAGGAA CACAAAAAAT TTTCTACATA 
481 TATCCTGGTT AGAGAGTGCT TAAAATAAGT TTTCCAAGAA TGGAAAAGAA ATGTTCTGAC 
541 TTAACAATTA AGACA6TATT TATTTAAAGC AAGAAATATG AGGCACACAA GA AAAT ATTT 
601 TCGGAAGAAA CCATTTGGTG AACAATATTT CAAATAAAAA TAGACAAACA TAGTCAATTG 
661 TAAAACATAT 6TTT6ACCAG CCCTTCTTTT CAATAGGCTT AATGTGAATA AAATGTTAAA 
721 GATTCTCTTT GGGTGGCTGC AAATTGTCCA CGAATAAGAC AAAATATAAA AATAAGGACT 
781 GAGTCTCACA AAATGAAAAG QAAATATATT CAGAAAGAGA ATCTTGAGAG AATGTGTT6T 
841 CACAAATTAA AGAAAACCTG TGGTGAATGA CATCCTGAGG CCTGAGCTAT TACTGACATT 
901 TAAGATAAAG GTAACTGTAT ACATTTGTCC CATTGAGGGG ACAAGAAAGC TGCTCTCATG 
961 TTCAGCTCTA TAATTCTTGC CTTAAACAAC TTAAATAGAA TGATTTAAAA TATGGAGCTG 
1021 TCCATGGACC TTTGA2UVTAT AAAATAGTCA AGCAACTTAT CAAG(3AATTA CA GATTC CTT 
1081 GATACTAACA CAGGTAAATC CCACACGTGT TTTGAGACTA CATTTGCTGG GATTTTATTG 
1141 ATGTAATAGG TCACATGTTT TTCGGGCCAA TGTTGCTGTT ATTCGGTTAC TTCAAGAGAA 
1201 TAGTGGCAAC TGATGCTATQ TATTCTAGGG GTTTGAAGTG ATGTTTCATG ATTGAAATTT 
1261 GTAAAAGAAT AACATCATCA TTCTTAACAA TAGAACATAT AAAGTCACAC AQAAGTGACA 
1321 GTC3TTTAAGC TGTACTATTG ATCAAAGAAA TTTATTACCT TCAGTTTCAA TGGAAATAAT 
1381 TACTGATAAT ACAAACATGT GTGAACACAC ACTAATCCTA TCCAAATOCA CAGTGATACA 
1441 CAGAAAATAT TAGCAAQTAG AATGCAATAT TTATATAACG ATTGTATTTA TCAATCAATT 
1501 GTATGTATCA ATATATGGGC TATTTTCrTTA CACATGATTT TATTCAAATT TACTCTAATC 
1561 ATTGTTGAAC CATTTAGAAA AGGC^^TACTG GCAACTTTTC CTTACCTTCAT CCAGCTGGGC 
1621 AAAAGTCCCA GTGTGGAGTA AAGGSTGCAA GATTTCCTGC TCTGTTAAGT ATAAAATAAT 
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Figure 20 (continuedV 

168 L AGTATGAATT CAAAGGTGCX: ATTCTTCTGC TTCTAGTTAT AAAGGCAGTG CTTGCTTCTT 
1741 CCAGCACAGA TCTGGATCTC GAGGAGCTTG GCGAGATTTT CAGGAGCTAA GGAAGCTAAA 
1801 AGCCGCCACC ATGAAAGCCA TCTTAATCCC ATTTTTATCT CTTCTGATTC CGTTAACCCC 
1861 GCAATCTGCA TTCGCTCAGA GTGAGCCGGA GCTGAAGCTG GAAAGTGTGG TGATTXSTCAG 
1921 TCGTCATGGT GTGCGTGCTC CAACCAAGGC CACGCAACTG ATGCAGGATG TCACCCCAGA 
1981 CGCATGGCCA ACCTGGCCGG TAAAACTGGG TTGGCTGACA CCGCGCGGTG GTGAGCTAAT 
2041 CGCCTATCTC GGACATTACC AACGCCAGCG TCTGGTAGCC GACGGATTGC TGGCGAAAAA 
2101 GGGCTGCCCG CAGTCTGGTC AGGTCGCGAT TATTGCTGAT GTCGACGAGC GTACCCGTAA 
2161 AACAGGCGAA GCCTTCGCCG CCGGGCTGGC ACCTGACTGT GCAATAACCG TACATACCCA 
2221 GGCAGATACG TCCAGTCCCG ATCCGTTATT TAATCCTCTA AAAACTGGCG TTTGCCAACT 
2281 GGATAACGCG A^CGTGACTG ACGCGATCCT CAGCAGGGCA GGAGGGTCAA TTGCTGACTT 
2341 TACCGGGCAT CGGCAAACGG CGfrTCGCGA ACTGGAACGG GTGCTTAATT TTCCGCAATC 
2401 AAACTTGTGC CTTAAACGTG AGAAACAGGA CGAAAGCTGT TCATTAACGC AGGCATTACC 
2461 ATCGGTUVCTC AAGGTGAGCG CCGACAATGT CTCATTAACC GGTGCGGTAA 6CCTCGCATC 
2521 AATGCTGACG GAGATATTTC TCCTGCAACA AGCACAGGGA ATGCCGGAGC CGGGGTGGGG 
2581 AAGGATCACC GATTCACACC AGTGGAACAC CTTGCTAAGT TTGCATAACG CGCAATTTTA 
2641 TTTGCTACAA CGCACGCCAG AGGTTGCCCG CAGCCGCGCC AdCCCGTTAT TAGATTTGAT 
2701 CAAGACAGCG TTGACGCCCC ATCCACCX3CA AAAACAGGCG TATGGTGTGA CATTACCCAC 
2761 TTCAGTGCTG TTTATCGCCG GACACGATAC TAATCTGGCA AATCTCGGCG GCGCACTGGA 
2821 GCTCAACTGG ACX3CTTCCCG GTCAGCCXK3A TAACACGCCG CCAGGTGGTG AACTGGTGTT 
2S81 TGAACGCTGG CGTCGGCTAA GCGATAACAG CCAGTGGATT CAGGTTTCGC TGGTCTTCCA 
2 941 GACTTTACAG CAGATGCGTG AIAAAACGCC GCTGTCMrTA AATACGCCGC CCGGAGAGGT 
3001 GAAACTGACC CTGGCAGGAT GTGAAGAGCG AAATGCGCAG GGCATGTGTT CGTTGGCAGG 
3061 TTTTACGCAA ATCGTGAATG AAGCACGCAT ACCCGCTTGC AGTTTGTAAG GTATAAGGCA 
3121 GTTATTGGTG CCCTTAAACG CXTTGCTGCTA CGCCTGAATA AGTGATAATA AGCGGATGAA 
3181 TGGCAGAAAT TCGCCGGATC TTTGTGAAGG AACCTTACTT CTGTGGTGTG ACATAATTGG 
3241 ACAAACTACC TACAGAGATT TAAAAAACCT CCCACACCTC CCCCTGAACC TGAAACATAA, 
3301 AATGAATGCA ATTGTTGTTG TTAACTXGTT TAITGCAGCT TATAATGGTT ACiU^TAAAG 
3361 CAATAGCATC ACAAATTTCA CAAATAAAGC ATTTTTTTCA CTGCATTCTA GTTGTGGTTT 
3421 GTCCAAACTC ATCAATGTAT CTTATCATGT CTGGATCGAT CCCCGGGTAC CGAGCTCGAA 
3481 TTCGTAATCA TGGTCATAGC TGTTTCCTGT GTGAAATTGT TATCCGCTCA CAATTCCACA 
3541 CAACATACGA GCCGGAAGCA TAAAGTGTAA AGCCTGGGGT GCCTAATGAG TGAGCTAACT 
3601 CACATTAATT GCGTTGCGCT CACTGCCCGC TTTCCAGTCG GGAAACCTGT CGTGCCAGCT 
3661 GCATTAAIGA ATC3GGCCAAC GCGCGGGGAG AGGCX3GTTTG CQTATTGGGC GCTCTTCOGC 
3721 TTCCTCGCTC ACTGACTOGC TGCGCTCGGT CGTTCGGCTG CGGCGAGCGG TATCAGCTCA 
3781 CTCAAAGGCG GTAATACGGT TATCCACAGA ATCAGGGGAT AACGCAGGAA AGAACIVTGTG 
3841 AGCAAAAGGC CAGCAAAAGG CCAGGAACCG TAAAAAGGCC GOGTTGCTGG CG'iTlTl'OCA 
3901 TAGGCTCCGC CCCCCTGACX5 AGCATCACAA AAATCGAOGC TCAAGTCAGA GGTGGCGAAA 
3961 CCCGACAGGA CTATAAAGAT ACCAGGCGTT TCCCCCTGGA AGCTCCCTCG TGCGCTCTCC 
4021 TGTTCC6ACC CTGCCGCrTA CCGGATACCT GTCCGCCTTT CTCCCTTCGG GAAGCXrTGGC 
4081 GCTTTCTCAA TGCTCACGCT GTAGGTATCT CAGTTCGGTG TAGGTCX3TTC GCTCCAAGCT 
4141 GGGCXGT6TG CACXSAACCCC CCGTTCAGCC CGACCGCTGC GCCTTATCCG GTAACTATCG 
4201 TCTTGAGTCC AACCCGGTAA GACACGACTT ATCGCCACTG GCAGCAGCCA CTGGTAACAG 
4261 GATXAGCAGA GCGAGGTATG TAGGCGGTGC TACAGAGTTC TTGAAGTGGT GGCCTAACTA 
4321 CGGCTACACT AGAAGGACAG TATTTGGTAT CTGCGCTCTG CTGAAGCCAG TTACCTTOGG 
4381 AAAAAGAGTT GQTAGCTCTT GATCCGGCAA ACAAACCACC GCTGGTAGCG GTGGTTTTTT 
4441 TGTTTGCAAG CAGCAGATTA CGCGCAGAAA AAAAGGATCT CAAGAAGATC CTTTGATCTT 
4501 TTCTACGGGG TCTGACGCTC AGTGGAACGA AAACTCACGT TAAQGGATTT TGGTCATGAG 
4561 ATTATCAAAA AGGATCTTCA CCTAGATCCT TTTAAATTAA AAATGAAGTT TTAAATCAAT 
4621 CTAAAGTATA TATGAGTAAA CTTGGTCTGA CAGTTACCAA TCCTTAATCA GTGAGGCACC 
4681 TATCTCAGCG ATCTGTCTAT TTCGTTCATC CATAGTTGCC TGACTCCCCG TCGTGTAGAT 
4741 AACTACGATA CGGGAGGGCT TACCATCTGG CCXZCAGTGCT GCAATGATAC CGCGAGACCC 
4801 ACGCTCACCG GCTCCAGATT TATCAGCAAT AAACCAGCCA GCCGGAAGGG CCGAGCGCAG 
4861 AAGTCGTCCT GCAACTTTAT CCGCCTCCAT CCAGTCTATT AATTGTTGCC GGGAAGCTAG 
4921 AGTAAGTAGT TCGCCAGTTA ATAGTTTGCG CAACGTTGTT GCCATTGCTA CAGGCATCGT 
49 81 GGTGTCACGC TCGTCGTTTG GTATGGCTTC ATTCAGCTCC GGTTCCCAAC GATCAAGGCG 
5041 AGTTACATGA TCCCCCATGT TGTGCAAAAA AGCGGTTAGC TCCTTCGGTC CTCCOATCGT 
5101 TGTCAGAAGT AAGTTGGCC6 CAGTGTmTC ACTCATGGTT ATGGCAGCAC TGCATAATTC 
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Figure 20 (continued): 

5161 TCTTACTGTC ATGCCATCCG TAAGATGCTT TTCTGTGACT GGTGAGTACT CAACCAAGTC 
5221 ATTCTGAGAA TAGTGTATGC GGCGACCGAG TTGCTCTTGC CCGGCGTCAA TACGGGATAA 
5281 TACCGCGCCA CATAGCAGAA CTTTAAAAGT GCTCATCATT GGAAAACGTT CTTCGGGGCG 
5341 AAAACTCTCA AGGATCTTAC CGCTGTTGAG ATCCAGTTCG ATGTAACCCA CTCGTGCACC 
5401 CAACTGATCT TCAGCATCTT TTACTTTCAC CAGCGTTTCT GGGTGAGCAA AAACAGGAAG 
5461 GCAAAATGCC GCAAAAAAGG GAATAAGGGC GACACGGAAA TGTTGAATAC TCATACTCTT 
5521 CCTTTTTCAA TATTATTGAA GCATTTATCA GGGTTATTGT CTCATGAGCG GATACATATT 
5 581 TGAATGTATT TAGAAAAA.TA AACAAATAGG GGTTCCGCGC ACATTTCCCC GAAAAGTGCC 
5641 ACCTGACGTC TAAGAAACCA TTATTATCAT GACATTAACC TATAAAAATA GGCGTATCAC 
5701 GAGGCCCTTT CGTCTCGCGC GTTTCGGTGA TGACGGT6AA AACXTTCTGAC ACATGCAGCT 
5761 CCCGGAGACG GTCACAGCTT GTCTGTAAGC GGATGCCGGG AGCAGACAAG CCCGTCAGGG 
5821 CGCGTCAGCG GGTGTTGGCG GGTGTCGGGG CTGGCTTAAC TATGCGGCAT CAGAGCASAT 
5881 TGTACTGAGA GTGCACCATA TGCGGTGTGA AATACCGCAC AGATGCGTAA GGAGAAAATA 
5941 CCGCATCAGG CGCCATTCGC CATTCAGGCT GCGCAACTGT TGGGAAGGGC GATCGGTGCG 
6001 GGCCTCTTCG CTATTACGCC AGCTGGCGAA AGGGGGATGT GCTGCAAGGC GATTAAGTTG 
6061 GGTAACGCCA GGGTTTTCCC A<StCACGAC6 TT6TAAAACG ACGGCXyUSTG CCAAGC 
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Figure 21 : Nucleic acid sequence of the known segment of the RlS/appa transeene used for 
the generation of transgenic mice (SEP JD NQ:S). 



LOCUS RlS/appa 3470 bp DNA SYN lS-APR-2000 

DEFINITION RlS/appa transgene with vector sequences removed. 

ACCESSION RlS/appa 

REFERENCE 1 (bases 1 to 3470) 

SOURCE synthetic construct. 

ORGANISM synthetic construct 
artificial sequence, 
salivary proline-rich protein, acid glucose-l-phosphatase; appA 
gene; periplasmic phosphoanhydride phosphohydrolase; artificial 
sequence; 

Goiovan, S., Forsberg, C.W., Phiilips, J. 
Unpublished. 



KEYWORDS 



ADTHORS 
JOURNAL 



DEFINITION 
ACCESSION 
VERSION 
SOURCE 

ORGANISM 

Mammalia; 

Rattus . 

REFERENCE 

AUTHORS 

TITLE 
encoding 

JOURNAL 
MEDLINE 
FEATURES 

source 



misc £eatiire 



Rat salivary proline-rich protein (RP15) gene, 

M64793 M36414 

M64793,l GI:206711 

Rat (Sprague-Dawley) liver DNA. 

Rattus norvegicus 

Eukairyota; Metazoa; Chordata; Craniata; Ven^ebrata; 

Butheria; Rodent ia; Sciurognathi ; Muridae; Murinae; 

1 (bases 1 to 1748) 
Lin,H.H. and Ann^D.K. 

Molecular characterization of rat multigene family 

proline-rich proteins 
Genomics 10, 102-113 (1991) 
91257817 

Location/ Qualifiers 
1. .1748 

/ organi sm= « Rattus norveg icus " 
/s train* "Sprague-Dawley " 
/dbj)cref ="taxon: lOlie" 
/ 1 is sue_typess liver * 
/tissue^libe^cosraid genomic library" 
1602-1810 

/functions" consensus sequence for initiation in 
higher eukaryotes * 



FEATURES 



Location/Qualifiers 



DEFINITION E. coli periplasmic phosphoanhydride phosphohydrolase <appA) 
gene, 

ACCESSION M5B70a L03370 L03371 L03372 L03373 L03374 L03375 
VERSION MS8708-1 GI: 145283 

SOURCE Escherichia coli DNA. 

ORGANISM Escherichia coli 

Bacteria; Proteobacteria; gatmaa subdivision; Enterobacteriaceae; 

Escherichia , 

REFEREMCB 1 (bases 18H..3109) 

AUTHORS Dassa,J,, MarcJc,C- and aoquet,P.L. 

TITLE The complete nucleotide sequence of the Escherichia coli cene appA 
reveals significant horoclog>' between pH 2.S acid phosphatase 
an(i glucose-l-phosphatase 
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Figure 21 (contiaucd): 



JOUXOVAL 
MEDLINE 

FEATURES 

Source 



sigjpeptide 
CDS 



J. Bacteriol. 172 (9), 5497-5500 (1990) 
90368616 



Location/Qualifiers 

1811. -3109 
/organlsm="Escherich.ia coli'* 
/db_xref = " taxon: 562 * 
1811- , 1876 
/gene=!"appA" 
1811. .3109 
/gene=''appA" 

/ s tajidard_iiame= "acid phosphatase/phytase " 
/ tranfil_table=sll 

/product="periplastnic phosphoanhydride pHosphohydrotarse* 
/protein_id="AAA72086 .1* 
/db.xref= -GI ! 145285 - 



/translation=«MKAILIPFI.SLLIPLTPQSAFAQSEPEIiKIjESVVIVSra 

TKATQLMQDNTTPDAWPTWPVKLGWLTPRGGELIAYI^^ 

GOyAIIAinaJERTRKrGEAFAAGIiAPDCAITVHTQADTSSPDPLF^ 

NVTIMI^RATOSIADFTGraiQTAFiUSLERVIOTPQSNIi^^ 

EIiKVSADN\rSLTGAVSLASHLTEIFLUX3AQCafl?EPGffG^ 

YIiLQRTPEVARSRATPIiIJ)LIKrALTPHPPQKQAYGVTL^ 

PPGEVKLTIAGCEERNAOGMCSIAGFTQrVNEHlIPACSL« 

matjpeptide 1877 3106 

/gene^'appA" 

/product="periplastaic phosphoanhydride phosphohydrolase" 

imitation replace{1817. . 1819, "gcg changed to gcc") 
/gene=*appA" 

/standard_nanie=''A3 mutant** 

/note= "created by site directed mutagenesis" 
/phenotype^t" silent mutation* 
mutation replace (3092. .3094, • ccg changed to ccc«) 
/genes "appA" 

/standard_name=** P428 mutant" 
/note=" created by site directed mutagenesis" 
/phenotype^" silent mutation ** 
nutation replace (3095. -3097, • gcg chainged to get") 
/gene=*appA" 

/standard_iiame»" A429 mutant" 
/note«"created by site directed mutagenesis" 
/phenotype«* silent mutation " 



polyA_signal 3262 . - 3457 

/note='»SV40 signals" 

BASE C»DNT 1065 a 721 c 735 g 949 t 

ORIGIN 

1 GGATCCCCTT TGCrTATGTAG TTTTTAAT(3G AAATTACAAC CC^TAGTGTG TTGATAAATA 
61 GAGAGTCCrTG "rTrGGTrTAA GCAACCTCTG TTTCTCATAA A(rrCCATAAA AACAGGAATA 
121 (rrCTTTGTTT CTTAGCMAAC CAAAAGATTT AGTGAATTGA AAACAATGTT CCCTTA<3AGT 
181 ATAGGTCTAA TAACCCC3GAA AATATTACCA TGATACTTGAG CATTTGTAAG TATCTTCATAG 
241 CATGTAGTAT CCMAGTCCA TCAATGAGAG ACACATTTAA CATGATTTTC ATTAATCAGG 
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Figure 21 (continued): 

301 TGGAAAAGAC ATGACAACAT TCACAGGCAC TGCACAGAAC ATAGTGGTCC ACCTTGCACA 
361 TATTTCACTA AACTAGGTTT ATCTATTTTG TTGCTTTCTC TAACATCTCT GCAATGAAGC 
421 AGGTCAACAG TGCCACATAT CCTTTACTTA ACCTAAGGAA CACAAAAAAT TTTCTACATA 
481 TATCCTGGTT AGAGAGTGCT TAAAATAAGT TTTCCAAGAA TGGAAAAGAA ATGTTCTGAC 
541 TTAACAATTA AGACAGTATT TATTTAAAGC AAGAAATATG AGGCACACAA GAAAATATTT 
601 TGGGAAGAAA CCATTTGGTG AACAATATTT CAAATAATIAA TAGACAAACA TAGTTAATTG 
661 TAAAACATAT GTTTGACCAG CCCTTCTTTT CAATAGGCTT AATGTGAATA AAATGTTA7UV 
721 GATTCTCTTT GGGTGGCTGC AAATTGTCCA CGAATAAGAC AAAATATAAA AATAAGGACT 
781 GAGTCTCACA AAATGAAAAG GAAATATATT CAGAAAGAGA ATCTTGAGAG AATGTGTTGT 
841 CACAAATTAA AGAAAACCTG TGGTGAATGA CATCCTGAGG CCTGAGCTAT TACTGACATT 
901 TAAGATAAAG GTAACTGTAT ACATTTGTCC CATTGAGGGG ACAAGAAAGC TGCTCTCATG 
961 TTCAGCTCTA TAATTCTTGC CTTAAACAAC TTAAATAGAA TGATTTAAAA TATGGAGCTG 
1021 TCCATGGACC TTTGAAATAT AAAATAGTCA "AGCAACTTAT CAAGGAATTA CAGATFCCTT 
1081 GATACTAACA CAGGTAAATC CCACACGTGT TTTGAGACTA CATTTGCTGG GATTTTATTG 
1141 ATGTAATAGG TCACATGTTT TTCGGGCCAA TGTTGCTGTT ATTCGGTTAC TTCAAGAGAA 
1201 TAGTGGCAAC TGATGCTATG TATTCTAGGG GTTTGAAGTG ATGTTTCATG ArTSAAATTT 
1261 GTAAAAGAAT AACATCATCA TTCTTAACAA TAGAACATAT AAAGTCACAC AGAAGTGACA 
1321 GTGTTTAAGC TGTACTATTG ATCAAAGAAA TTTATTACCT TCAGTTTCAA TGGAAATAAT 
1381 TACTGATAAT ACAAACATGT GTGAACACAC ACTAATCCTA TCCAAATGCA CAGTGATACA 
1441 CAGAAAATAT TAGCAAGTAG AATGCAATAT TTATATAACG ATTGTATTTA TCAATCAATT 
1501 GTATGTATCA ATATATGGGC TATTTrCTTA CACATGATTT TATTCAAATT TACTCTAATC 
1561 ATTGTTGAAC CATTTAGAAA AGGCATACTG GCAACTTTTC CTTACCTCAT CCAGCTGGGC 
1621 AAAAGTCCCA GTCTGGAGTA AAGGATGCAA GATTTCCTGC TCTGTTAAGT ATAAAATAAT 
1681 AGTATGAATT CAAAGGTGCC ATTCTXCTGC TTCTAGTTAT AAAGGCAGTG CTTGCTTCTT 
1741 CCAGCACAGA TCTGGATCTC GAGGAGCTTG GCX5AGATTTT CAGGAGCTAA GGAAGCTAAA 
1801 AGCCGCCACC ATGAAAGCCA TCTTAATCCC ATTTTTATCT CTTCTGATTC CXfTTAACCCC 
1861 GCAATCTGCA TTCGCTCAGA GTGAGCCGGA GCTGAAGCTG GAAAGTGTGG TGATTGTCAG. 
1921 TCGTCATGGT GTGCGTGCTC CAACCAAGGC CACGCAACTG ATGCAGGATG TCACCCCAGA 
1981 CGCATGGCCA ACCTGGCCGG TAAAACTGGG TTGGCTGACA CCGCGCX5GTG GTGAGCTAAT 
2041 CGCCTATCTC GGACATTACC AACGCCAGCG TCTGGTAGCC GACGGATTGC TGGCGAAAAA 
2101 GGGCTGCCCG CAGTCTGGTC AGGTCGCGAT TATTGCTGAT GTCGACGAGC GTACCCGTAA 
2161 AACAGGCGAA GCCTTCX3CCG CCGGGCTGGC ACCTGACTGT GCAATAACCG TACATACCCA 
2221 GGCAGAIACG TCCAGTCCCG ATCC6TTATT TAATCCTCTA AAAACTGGCG TTTGCCAACT 
2281 GGAJAAOGCG AAGGXGACTG AjOGCGATCXTT CAGCAGGGCA GGAGGGTCAA TTGCTGACTT 
2341 TACCGGGCAT CGGCAAACGG CGTTTCGCX^A ACTGGAACGG 6T6CTTAATT TTCCGCAATC 
2401 AAACTTGTGC CTTAAACGTG AGAAACAGGA CGAAAGCTGT TCATTAACGC AGGCATTACC 
2461 ATCGGAACTC AAGGTGAGCG CCGACAATGT CTCATTAACC GGTGCGGTAA GCCTCX5CATC 
2521 AATGCTGACG GAGATATTTC TCCTGCAACA AGCACAGGGA ATGCCGGAGC CGGGGTGGGG 
2581 AAGGATCACC GATTCACACC AGTGGAACAC CTTGCTAAGT TTGCATAACG CGCAATTTTA 
2641 TTTGCXACAA OQCACOCCAG AGGTTGCCCXS CAGCCGCGCC ACCCCGTEAT TAGATTTGAT 
2701 CAAGACAGCG TTGAOGCXZCC ATCCACCX5CA AAAACAGGCX5 TATGGTGTGA CATTACCCAC 
2761 TTCAGTGCT6 TTTATCGCCG GACACGATAC TAATCTGGCA AATCTCGGCG GCGCACTGGA 
2821 GCTCAACTGG ACGCTTCCCX^ GTCAGCCGGA TAACACGCCG CCAGGTGGTG AACTGGTGOrr 
2881 TGAAC5GCTGG CXTTCGGCTAA GCGATAACAG CCAGTGGATT CAGGTTTOGC TGGTCTTCCA 
2941 GACTTTAC3VG CAGATGCGTG ATAAAACGCC GCTGTOITTA AATACGCCX3C CCXHOAGAGGT 
3001 6AAACTGACC CTGGCAGGAT GTGAAGA606 AAATGGGGAG GGCATGTGTT OGTTGGCAGG 
3061 TTTTACGCAA ATCGTGAATG AAGCAGGCAT ACCCGCTT6C AGTTTGTAAG GTATAAGGCA 
3121 GTTATTGGTG CCCTTAAAC3G CCTGGTGCTA CGCCTGAATA AGTGATAAXA AGCGGATGAA 
3181 TGGCAGAAAT TCGCCGGATC TTTGTGAAGG AACCTTACTT CTGTGGTGTG ACATAATTGG 
3241 ACAAACTACC TACAGAGATT TAAAAAACCT CCCACACCTC CCCCTGAAGC TGAAACATAA 
3301 AATGAATGCA ATTGTTGTTG TTAACTTGTT TATTGCAGCT TATAATGGTT ACATlATAAAG 
3361 CAATAGCATC ACAAATTTCA CAAATAAAGC ATTTTTTTCA CTGCATTCTA GTTGTGGTTT 
3421 GTCCAAACTC ATCAATGTAT CTTATCATGT CTGGATCGAT CCCCGGGTAC 
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Figgre 22: Nucleic acid sequence of the SV40/APPA+iiitron plasmid fSEO ID NQ:6V 

LOCUS SV40/APPA 5421 bp DNA CIRCUIAR SYN 14-APR-2000 

DEFINITION Ligation of SV40 promoter/enhancer into CAT/APPA+intron 

ACCESSION SV40/APPA 

REFERENCE 1 (bases 1 to 5421) 

SOURCE synthetic construct, 

ORGANISM synthetic construct 
artificial sequence. 
KEYWORDS SV40 promoter/ enhancer , acid glucose-l-phosphatase; appA gene; 
periplasmic phosphoanhydride phosphohydrolase ; artificial 
sequence ? 

AUTHORS Golovan, Forsberg, C.W., Phillips, J. 

JOURNAL Unpublished 

DEFINITION E. coli periplasmic phosphoanhydride phosphohydrolase (appA) 
gene, 

ACCESSION M58708 L03370 L03371 L03372 L03373 L03374 L03375 
VERSION M58708.1 GI:14S283 

SOURCE Escherichia coli TOA. 

ORGANISM Escherichia coli 

Bacteria; Proteobacteria ; gamma subdivision; Enterobacteriaceae ; 

Escherichia . 



REFERENCE 1 (bases 40 1337) 

AUTHORS Das6a,J. f Marck^C. and Boquet^P.L. 

TITLE The complete nucleotide sequence of the Escherichia coli gene appA 
reveals significant homology between pH 2.5 acid phosphatase 
and glucose* 1 -phosphatase 

JOURNAL J. Bacteriol. 172 (9), S497-S500 (1990) 

KEDLINE 90368616 



FEATURES Location/Qualifiers 
Source 40 1337 

/organism^" Escherichia coli" 
/db_xref = " taxon t S62 " 

sxg_peptide 40, . 105 

/gene="appA" 
CDS 40 1337 

/gene="appA" 
/6tandard_nanie«"acid phosphatase/phytase" 

/ tran6l_table«ll 

/products "periplasmic phosphoanhydride phosphohydrolase" 

/protein_ide " AAA720 86 . 1 « 

/db_xrefo»GItl4S285" 

/translation«"I«CAILIPFLSLLIPLTPQSAFAQSEPBLKLESVVTVSRHGVRAP 

TKATQI^IQIOTPDAWPTWPVKIXS^TPRGGELIAYLG^ 

GQVAIIADVDERTRKTGEAFAAGLAPDCAITVHTQADTSSPDPLEWPIiKTGVCQU^ 

MVTDAILSRAGGSIADFTGHRCn^iFRELERVIOTPQSNIiCLiaiETO 

£LKVSADN\^LTGAVSLASMLTEIFIiLQQAQ(^EPGWaRITDSHQWNTLLSL^^ 

YIJjQRTPEVARSRATPLLDLIKTALTPHPPQKQAYGVTLPTSVLFIAGHDTNLAH^ 

AL£LNVniiPGQPDllTPPGGELVFERWRRLSDNSQWIQVSLVFQTIX)WRDKTP^ 

PPCHVKLTLAGCEEBNACyaiCSLAGFTQIVNBARIPACSL** 

matjpeptide 106 1334 

/genes^appA" 
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Figure 22 (contmucdV, 



mutation 



mutation 



mutatxon 



/products "periplasmic phosphoanhydride phosphohydrolase" 

replace(46.- 48,"gcg changed to gcc") 
/gene=*'appA*' 

/standard_name="A3 mutant" 

/note=* created by site directed mutagenesis** 
/phenotype=" silent mutation" 
replace(1320. .1322," ccg changed to ccc-) 
/gene="appA" 

/standard__name=" P428 mutant" 
/note="created by site directed mutagenesis" 
/phenotypea" silent mutation " 
replace (1323- .1325," gcg changed to gcf) 
/genes " appA" 

/standard__name=" M29 mutant" 
/note«"created by site directed mutagenesis" 
/phenotypess" silent mutation " 



DEFIKITION Plasmid pBLCAT3 (bases 2200 to 4924) 
ACCESSION X64409 
VERSION X64409,l GI: 58163 

SOURCE synthetic construct. 

ORGANISM synthetic construct 
artificial sequence. 
REFERENCE 1 (bases 2200 to 4924) 
AUTHORS Luchow,B-H.R. 
TITLE Direct Submission 

JOURNAL Submitted (06-FEB-1992) B.H.R- Luckow, German Cancer Res 
Center, Ira Keuenheimer Feld 280, w^6900 Heidelberg, FRG 
REFERENCE 2 (bases 2200 to 4924) 
AUTHORS Luc)cow,B. and Schutz,G. 

TITLE CAT constructions with multiple unique restriction sites 



for 

regulatory 

JOURNAL 
MEDLINE 
COMHENT 
es^eriments 

FEATURES 

source 



the fxinctional analysis of eukaryotic promoters and 
elements 

Nucleic Acids Res. 15 (13), 5490 (1987) 
87260024 

Prorooterless CAT vector for transient transfection 

with eukaryotic cells. Allows the analysis of foreign 
promoters and enhancers. 

Location/Qualifiers 

2200 to 4924 

/organism^" synthetic construct" 
/db_xref = " taxon : 3263 0 " 



SV40 t intron 

polyA_signal 

CDS 



1380. .1993 

/note«"SV40 signals • 
1990- .2230 
/note="SV40 signals" 
con^leraent (3471. .4317) 
/codon_start=l 
/ transT_table=l 1 
/gene="Arap" 

/producte"beta-lactamase'' 
/protein_id-"CAA4S7S3 .1" 
/db_xref= "01:58165" 
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Figure 22 ('continued'): 



SV40 promoter/ enhancer 5023.. 5402 
/note="SV40 signals" 



BASE COUNT 1413 a 1321 c 1331 g 1355 t 

ORIGIN ^ ^f^.j^f^ AGGAGCTAAG GAAGCTAAAA GCCGCCACCA TGAAAGCCAT CTT AATCCCA 
61 TTTTTATCTC TTCT6ATTCC GTTAA.CCCCG CAATCTGCAT TCGCTCAGAG TGAGCCGGAG 
121 CTGAAGCTGG AAAGTGTGGT GATTSTCAGT CGTCATGGTG TGCGTGCTCC AACCAAGGCC 
181 ACGCAACTGA TGCAGGATGT CACCCCAGAC GCATGGCCAA CCTGGCCGGT AAAACTGGGT 
241 TGGCTGACAC CGCGNGGTGG TGAGCTAATC GCCTATCTCG GACATTACCA AC6CCAGCGT 
301 CTGGxAGCCG ACGGATTGCT GGCGAAAAAG C-GCTQCCCGC AGTCTGGTCA GGTCGCGATT 
3G1 ATTGCTGATG TCGACGAGCG TACCCGTAAA ACAGGCGAAG CCTTCGCCGC CGGGdGGCA 
421 CCTGACTGTG CAATAACCGT ACATACCCAG GCAGATACGT CCAGTCCCGA TCCGTTATTT 
481 AATCCTCTAA AAACTGGCGT TTGCCAACTG GATAACX3CGA ACGTGACTGA CGCGATCCTC 
541 AGCAGGGCAG GAGGGTCAAT TGCTGACTTT ACCGGGCATC GGCAAACGGC GTTTCGCGAA 
601 CTGGAACGGG TGCTTAATTT TCCX3CAATCA AACTTGT6CC TTAAACGTGA GAAACAGGAC 
561 GAAAGCTGTT CATTAACX3CA GGCATTACCA TCGGAACTCA AGGTGAGCGC CGACAATGTC 
721 TCATTAACCG GTGCX3GTAAG CCTCGCATCA ATGCTGACGG AGATATTTCT CCTGCAACaA 
781 GCAC3WK3GAA TCCCGGAGCC GGGGTGGGGA AGGATCACCG ATTCACACCA GTGGAACACC 
841 TTGCTAAGTT TGCATAACGC GCAATTTTAT TTGCTACAAC GCACGCCAGA GGTTGCCCGC 
901 AGCCGCGCCA CCCCGTTATT AGATTTGATC AAGACAGCGT TGACX3CCCCA CCACCGCAAA 
961 AACAGGCGTA TGGTGTGACA TTACCCACTT CAGTGCTCTT TATCGCCGGA CaCGATACTA 
1021 ATCTCGCAAA TCTCGGCGGC GCACTGGAGC TCAACTGGAC GCTTCCCX3GT OWSCCGGATA 
1081 ACACGCCGCC AGGTGGTGAA CTGGTGTTTG AACGCTGGCG TCGGCTAAGC GATAACAGCC 
1141 AGTGGATTCA GGTTTCGCTG GTCTTCCAGA CTTTACAGCA GATGCGTGAT AAAACG^GC 
1201 TCTCATTAAA TAC6CCGCCC GGAQAGGTGA AACTGACCCT GGCAGGATGT GAAGAGCGAA 
1261 ATGCGCAGGG CATGIGTTOG TTGGCAGGTT TTACGCAAAT OSTGAATCAA GCACGCATAC 
1321 CCGCrroCAG TTTGTAAGGC AGTTATTGGT GCCCTXAAAC GCCTGGTGCT ACXK:CTGAM 
1381 AAGTGATAAT AAGCGGATGA ATCGCAGAAA TTCG0CX3GAT CTTTGTGAAG GAACCTTACT 
1441 TCTGTGGTGT GACATAATTG GACAAACTAC CXACA6AGAT TTAAAGCTCT AAGGXAAKTA 
1501 TAAAATTTTT AAGTGTATAA TGTGTTAAAC TACTGATTCT AATrGTTTGT GTATTTIAfiA 
1561 TTCCAACCTA TGGAACTGAT GAATGGGAGC AGTGGTGGAA TGCCTTTAAT GAGGAAAACC 

X621 mri ii G crc agaagaaatg ccatctagtg atgatcaggc tactgctgac tctcaacatt 

1681 CTACTCCrCC AAAAAAQAAG AGAAA66TAG AAGACCCCAA GGACTTTCCT TCAGAMTOC 
1741 TAAGTTTrrr GAGTCATGCT GTGTTTAGXA AIAGAACTCT TGCTTGCTTT GCTATTTACA 
1801 CCACAAAGGA AAAAGCTGCA CTGCTATACA AGAAAATTRT GGAAAAATAT TCTGTAA^ 
1861 TTATAAGTAG GCATAACAGT TATAATCATA ACATACTCTT TTTTCTTACT CCACACAGGC 
1921 ATAGAGTGTC TGCTATTAAT AACTATGCTC AAAAATTGTG TACCTTTAGC TTTTTAAITT 
1981 GTAAAGGGGT TAATAAGGAA TATTTGATGT ATAGXGCCTT GACTAGAGAT CATAMOGC 
2041 CAXACCACAT TTSraGRGGT TTTACTTGCT TTAAAAAACC TCCCACACCT CCCCCTGAAC 
2101 CIGAAACAIA AAATCRMGC AAIXGTIGTT GTTAACTTGT TT ATTGCAGC TTATAATGGT 
2161 TACAAAIAAA GCAATM3CAT CACAAATTTC ACAAATAAAG CATTTTTTTC ACTGCATTCT 
2221 AGTTGIGGTT TGTCCAAACT C3^TCAATCTA TCTTATCATG TCTGGKTCGA TOCCOTG^ 
2281 COGAGCTCGA ATTCQXAATC ATGGTCATAG CTGTTTCCTG TOTGAAATTG TTATCCGCTC 
2341 ACAATTCCAC ACAACATACG AGCCGGAAGC ATAAAGTGTA AA6CCTGGGG TGCCTAATGA 
2401 GTCAGCIAAC TCACaiTAAT TOCGTTGCGC TCACTGCCCG CTTTCCAGTC GGGAAACCTG 
2461 TCGTCCCAGC TGCATTAATO AATCOGCCAA OGOSCGGGGA GAGGCGGTTT GOGTATTOGG 
2521 CGCTCTTCCG CTTCCTCGCT CACTGACTOS CTGCGCT0G6 TCJQTTCGGCT GC6GCGMCG 
2S81 CTATCAGCTC ACTCAAAGGC GOTAATACGG TTATCCACAG AATCAGGGGA TAACGCRG^ 
2641 AAGAACATCT GAGCAAAAGG CCAGCAAAAG GCCAGGAACC GTAAAAAGGC CGCGTTGCTG 
2701 GCGTTTTrCC ATAGGCTCCG CCCCCCTGAC GAGCATCACA AAAATCX3ACG CJ^^M^G 
2761 AGGTGGCGAA ACCC6ACAGG ACTATAAAGA TACCAGGCGT TTCCCCCTGG AAGCTCCCTC 
2821 GTGCGCTCrC CTGTTCCGAC CCTGCOGCTT ACCGGATACC TCTCCGCCTT TCTTCCTICG 
2881 GGAAGCGTGG CGCTTTCTCA ATOCTCACGC TCTAGGTATC TCAGTTCGGT 6TAGCTCGTT 
2941 CGCrCCAAQC TGGGCTGTCT GCACGAACCC CCCGTTCAGC CCGACCGCTG CGCCTTATCC 
3001 GGTAACTATC GTCTTGAtSTC CAACCCGGTA AGACACGACT TATCGCCACT GOCAGCaGCC 
3061 ACTGGTAACA GGATTAGCAG AGCGAGGTAT GTAGGCXJOTG CTACAGAGTT CTTGAAGTGG 
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Figure 22 (coatinued): 

3121 TGGCCTAACT ACGGCTACAC TAGAAGGACA GTATTTGGTA TCTGCGCTGT GCTGAAGCXZA 
31B1 GTTACCTTCG GAAAAAGAGT TGGTAGCTCT TGATCCGGCA AACAAACCAC CGCTGGTAGC 
3241 GGTGGTTTTT TTGTTTGCAA GCAGCAGATT ACGCGCAGAA AAAAAGGATC TCAAGAAGAT 
33 01 CCTTTGATCT TTTCTACGGG GTCTGACGCT CAGTGGAACG AAAACTCACG TTAAGGGATT 
3361 TTGGTCATGA GATTATCAAA AAGGATCTTC ACCTAGATCC TTTTAAATTA AAAATGAAGT 
3421 TTTAAATCAA TCTAAAGTAT ATATGAGTAA ACTTGGTCTG ACAGTTACCA ATGCTTAATC 
3481 AGTGAGGCAC CTATCTCAGC GATCTGTCTA TTTCGTTCAT CCATAGTTGC CTGACTCCCC 
3541 GTCX3TGTAGA TAACTACGAT ACGGGAGGGC TTACCATCTG GCCCCAGTGC TGCAATGATA 
3601 CCGCX3AGACC CACGCTCACC GGCTCCAGAT TTATCAGCAA TAAACCAGCC AGCCGGAAGG 
3661 GCCGAGCGCA GAAGTGGTCC TGCAACTTTA TCCGCCTCCA TCCAGTCTAT TAATTGTTGC 
3721 CGGGAAGCTA GAGTAAGTAG TTCGCCAGTT AATAGTTTGC GCAACXSTTGT TGCCATTGCT 
37 81 ACAGGCAxCG TGGTGTCACG CTCGTCGTTT GGTATGGCTT OVTTCAGCTC CQQTTCCCAA 
3841 CGATCAAGGC GAGTTACATG ATCCCCCATG TTGTGCAAAA AAGCGGtTAG CTCCTTCGGT 
3901 CCTCCGATCG TTGTCAGAAG TAAGTTGGCC GCAGTGTTAT CACTCATGGT TATGGCAGCA 
3961 CTGCATAATT CTCTTACTGT CATGCCATCC GTAAGATGCT TTTCTGTGAC TGGTGAGTAC 
4021 TCAACCAAGT CATTCTGAGA ATAGTGTATG CGGCGACCGA GTTGCTCTTG CCCGGCGTCA 
4081 ATACGGGATA ATACCGCGCC ACATAGCAGA ACTTTAAAAG TGCTCATCAT TGGAAAACX3T 
4141 TCTTCGGGGC GAAAACTCTC AAGGATCTTA CCGCTGTTGA GATCCAGTTC GATGTAACCC 
4201 ACTCGTGCAC CCAACTGATC TTCAGCATCT TTTACTTTCA CCAGCGTTTC TGGGTGAGCA 
4261 AAAACAGGAA GGCAAAATGC CGCAAAAAAG GGAATAAGGG CGACACGGAA ATGTT6AATA 
4321 CTCATACTCT TCCTTTTTCA ATATTATTGA AGCATTTATC AGGGTXATTG TCTCATGAGC 
4381 GGATACATAT TTGAATGTAT TTAGAAAAAT AAACAAATAG GGGTTCCGCG CACATTTCCC 
4441 CGAAAAGTGC CACCTCACGT CTAAGAAACC ATTATTATCA TGACATTAAC CTAIAAAAAT 
4S01 AGGCGTATCA CGAGGCCCTT TCGTCTC3GC6 CGTTTCGGTG ATGACJGGTGA AAACCTCTGA 
4561 CACATGCAGC TCCCGGAGAC GGTCACAGCT TGTCTGTAAG CGGATGCCGG GAGCAGACAA 
4621 GCCCGTCAGG GCGCGTCAGC GGGTGTTGGC GGGTGTCGGG GCTGGCTTAA CTMGOGGCA 
46 Bl TCAGAGCAGA TTGTACTGAG AGTGCACCAT ATGCGGTGTG AAATACCGCA CAGATGCGXA 
4741 AGGAGAAAAT ACCX3CATCAG GCGCCATTCG crATTCAGGC TGCGCAACTG TTGGGAAGGG 
4801 CGATCGGTGC GGGCXTTCTTC GCTATTACGC CAGCTGGCGA AAGGGGGATG TGCTGCAAGG 
4861 CGATTAAGTT GGGTAACGCC AGGGTTTTCC CAGTCACGAC GTTGTAAAAC GACGGCCAGT 
4921 GCCAAGCTTT ACACTTTATG CTTCCGGCTC GTATGTTGTG TGGAATTGT6 AGCGGATAAC 
4981 AATTTCACAC AGGAAACAGC TATGACCATG ATTACGAATT CGGCGCAGCA CCATOQCXTTG 
5041 AAATAACCTC TCAAAGAGGA ACTTGGTTAG GTACCTTCTG AGGCGGAAAG AACCAGCTGT 
5101 6GAATX3TGTG TCAGTTAGGG TGTGGAAAGT CCCCAGGCTC CCCAGCAGGC AGAAGTATGC 
5161 AAAGCATGCA TCTCAATTAG TCAGCAACCA GGTGTGGAAA GTCXZCCAGGC TCCCCAGCAG 
5221 6CAGAAGTAT GCAAAGCATG CATCTCAATT AGTCAGCAAC CATAGTCXTCG CCCXTTAACTC 
5281 CGCCCATCCC GCCCCTAACT CCX3CCCAGTT CCGCXTCMrrC TCCGCCCCAT GGCTGACTAA 
5341 TTTTTTTTAT TTATGCAGAG GCCGAGGCCG CCTCGGCCTC TGAGCTATTC CAGAAGTAGT 
5401 GA6GA0GCTC GAGGAGCTTG G 
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Figure 23, The nucleic acid sequence of the Lama2/APPA transgeneused for the generation 
of transgenic mice and transeenic pigs (SEP ID NO: 7) 



LOCUS transgene 17732 bp DNA SYN 14-APR-2 000 

DEFINITION Lama-appA cut Xhol., 20623 to Notl., 17732 
ACCESSION transgene 

KEYWORDS parotid secretory protein; acid glucose-l-phosphatase; appA 
gene; 

periplasmic phosphoanhydride phosphohydrolase; artificial 
sequence; 
cloning vector 
REFERENCE 1 (bases 1 to 17732) 

AUTHORS Golovan, S., Forsberg, C.W., Phillips, J. 
JOURNAL Unpubi i shed . 



FEATURES 

DEFINITION M. musculus Psp gene for parotid secretory protein- 
ACCESSION X68699 
VERSION X6Z699A GI:53809 
SOURCE house mouse. 

ORGANISM Mus musculus 

Bukaryota; Metazoa; Chordata; Craniata; Vertebrata; Hammalia; 

Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; Mus. 

REFERENCE 1 (bases 3777 to 5332;) 

AUTHORS Svendsen,P-, Laursen.J., Krogh-Pedersen,H. and Hjorth, J.P. 
TITLE Novel salivary gland specific binding elements located in 

the PSP proximal enhancer core 
JOURNAL Nucleic Acids Res- 26 (11), 2761-2770 (1998) 
MEDLINE 98256451 

REFERENCE 2 (bases 7147 to 12653; 13952 to 17731) 
AUTHORS Mikkelsen.T.R. 

TITLE Direct Submission 

JOURNAL Submitted (07-0CT-1992 ) T.R. Mikkelsen, Department of 

Molecular Biology, University of Aarhus, CP Hollers Alle 
130, 8000 Aarhus, DENMARK 

ISEFERENCE 3 (bases 7147 to 126S3; 13952 to 17731) 

AUTHORS Laursen J, Hjoz:t:h JF 

TITLE A cassette for high-level expression in the mouse salivary 
glands . 

JOURNAL Gene 1997 Oct 1;198 (1-2) :367-72 
MEDLINE 9370303 



FEATURES Location/Qualifiers 

source 1-to 12653; 13952 to 17731 

/organisms "Mus musculus** 
/st:xain= "C3H/AS " 
/db_xref e « taxon : 1 0 0 9 0 " 
/chromosome="2 " 

/raap="Estimate: 69 cM from centromere" 
/clone«''Lambda YPl, Lambda YP3, Lambda YP7" 
/clone_lib«"Lainbda-PHAGE (Lambda L47,l)'' 
/germline 
/note«"Allele: b" 

misc_feature 3777*5332 
/genes "PSP" 

/function^" salivary gland specific positive acting 
regulatory region" 
enhancer 7147.. 8724 
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Figure 23 (continued^ 

/ evidence=exper imental 
exon 1177B.. 11824 

/gene="Psp" 
/note="exon a" 
/nuniber=l 

/evidence=experiinental 
exon 12626.. 14190 

/note="exon b fused with exons h and i" 
misc^feature 12644-12652 

/functions" consensus sequence for initiation in higher 

eukar'/otes * 
misc_feature 13952-13965 

/function=" M13mplB polylinker" 



DEFINITION E. coli periplasndc phosphoanhydride phosphohydrolase (appA) 
gene, 

ACCESSION M58708 1*03370 L03371 L03372 L03373 Ii03374 L03375 

VERSION M58708*l GI:145283 

SOURCE Escherichia coli DNA. 

ORGAOTSM Escherichia coli 

Bacteria; Proteobacteria; gamma subdivision; 
Enterobacteriaceae ? 

Escherichia. 



REFERENCE 1 (bases 12653 . .13951) 



AUTHORS 
TITLE 



JOURNAL 
MEDLINE 



Das8a,Jw Marck,C. and Boquet,P.L- 
The complete nucleotide sequence of the Escherichia coli 
gene appA reveals significant homology between pH 2.5 
acid phosphatase and glucose-1 -phosphatase 
J. Bacteriol. 172 (9), 5497-5500 (1990) 
90368616 



FEATURES 

Source 



sigjeptide 

/gene=*appA" 

CDS 



Location/Qualifiers 

12653, -13951 
/orgaLnisni="Escherichia coli" 
/db_xr e f = " t axon : 5 62 " 
12653.-12718 

12653 13951 
/genes* i^pA* 

/standardjaame«''acid phosphatase /phytase* 
/transl_table«ll 

/products "periplasmic phosphoanhydride 
phosphohydrolase " 
/protein_id= " AAA7 2086.1" 
/dbjjcref -•'GI : 145285 - 



/translation«"MKAILIPFLSLLIPLTPQSAFAQSEPELKLESWIVSRHGVRAP 
TKATQLMQDVTPDAWPTWPVKLGWLTPRGGELIAYLGHYQRQRLVADOLLAI^ 
GOVAIIADVDERTRKTGEAFAAGLAPDCAITVHTQADTSSPDPLPNPLRTGVC^ 
imDAILSRAGGSIADFTGHRQTAFRELERVLNFPQSNLCLlCREKQDESCSLTQALPS 
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Figure 23 (continued): 

ELKVSMWSLTGAVSIiASMLTEIFLIXSQAQGM 

YLIiQRTPEVARSRATPLLDLIKTALTPHPPQKQAYGVTLPTSVLFIAGHDT^^ 

ALEIiin^TLPGQPDNTPPGGELVFERWRRLSDNSQWIQVSLVFQTLQQMRD 

PPGEVKLTLAGCEERNAQGMCSIiAGFTQIVNEARIPACSL" 
mat_peptide 12719 13948 

/gene="appA" 

/product = "periplasmic phosphoanhydride 
phospbohydrolase " 

imitation replace (12659. . 12661, "gcg changed to gcc") 
/genes^appA" 

/standard_name="A3 mutant" 

/note=" created by site directed mutagenesis 
/citation^ [3] 

/phenotype=" silent mutation" 
mutation replace (13934. •13936, " ccg changed to ccc") 
/gene«"appA" 

/standard_name»*» P428 mutant" 

/note==" created by site directed mutagenesis" 

/citation= [3] 

/phenotype=" silent mutation " 
mutation replace (13937. .13939, « gcg changed to get") 
/gene»"appA" 

/standardjaame=* A429 mutant" 
/note«"created by site directed mutagenesis" 
/citation^ 13] 

/phenotype=" silent mutation " 



BASE COmn 4719 a 4125 c 4168 g 4719 t 

ORXOIN^ 

1 TCGAGAGTAT CTTTGTCAGC TGTGCCTCCA ACARAGGGCST ACTGTTGCCC ACATAGAAAG 
€1 ATCTAAACTA ATTAATTAAT CCCTCACCCG CAAATCTTTC AGTCACTAAG TTRGCRCGAT 
121 TGTTGAACAA. GTTCTCCAAA GGAGAfiATAC AOATGAGTGC GTATAGGGTG GACCTOaCTG 
181 CXORGGAGAC ACCTGCATCT GACTAAGAAQ AGCCACGQTO TTAGTTGAAT GGTGTGGAGT 
241 JW3GGTGGTTC TGTOGGACAG TAGARAATCG AGAGGCATGT GCCX3TTTAGT QAACTGATGG 
301 AAGCTACCCX: ftAAC3QACAfiR. OATTGTCRGT CAGGCCAATC CGTTTCGAGT TTGATGGGCA 
361 GCCX3GACRGT GAGACAGACA CM3CXACTCA GTTG6RGGAA GGATOAGAAC AATGGC PAGC 
421 AGGGRTTGAG AGACCCTGAC AGGCX3CAAGG CCCTAACACA CRCACXrCACC ACCTCRCITG 
481 ACaAAGCXGC CAAAGACCAA AGACTTGTrC TCCAirAGAA ATOACAGCTG GCTTGMC06 
541 ACAGCaTAAT AAGCAGAGTG TACTCTGATT GGAGAACTTT AATGTGTTTC ATTCAGXATT 
601 AIAAAAGOAC ABTATTACAG ATTTTCTTGT ACACTGCTGT TA CATGTGG G GCAGTOTOTC 
661 TTTAAGTAGO CTAAAGIACT CTTTAAAAAT GGGTCCTAGA TATTTTTTCC TTTAACTCAA 
721 GTCTCTTACT GTTTAAATGA TTTTTATTTT GTTTAATATG QAGGAAAAAO AAGCGTAAAT 
781 GGACAATATA TATTTAGAGA AAGATGGTTA GCTGTCAGAA AAATATGCAA JCXCAAAATCA 
841 CACCAAGACT GCAGCACACC CCTGTCAGAT GGCTGTGATC AAQAAAATAA ATQACaATGA 
901 OTQGTGGT3A AGATGTACTA AAGGGAAACA CACACAC3VCA CACACACACA CACACACACA 
961 CACa^CTGQAG CAACCACTGT GGAAATCAQT ATGAATGGTC CTCAAAAACC TGAAGATAGA 
1021 GCGGGGCX3TC GTCGCATACA CTTTTATTCC CaGCACIGGG GAGGCAGAGG CAGOTGGATC 
1081 TCTGAGTTCC AGGCCAGCCT GGTCTATAGC ACAGGTTCTA GQACA6CCAQ GGCTACACAG 
1141 AAAAACCCro CCrTGATTAA ACCAAACCAA ACCAAACCAA ACCAAACCAA ACCAAACCAA 
1201 ACCAAACCAA ACCAAACCAG ACCAAACCAA AACACXGAAG ATAGAACTTC AGTATTCCAT 
1261 TCCTAGATAT ATACCCAATG GAGACTAAGT CAGCAAGACA CCTOCACAGC CATGTTCACT 
1321 ACTACACTGT TCACCACAGC CAGGCTGTGG AACCAGCCTG ACT&TCCATG ATAAATGAAT 
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Figure 23 fcontumed'): 

1381 GGATAC3GTAA CTTTCAAGGT AAATCGACTC TGCTGTGTAC ATGCCTCACR TTCTCTTTAT 
1441 TCATTTrrCT TTATGAGGTC TCCATTCAGG AC3TCACATGG TAGTTCTATT TTCAGTCTTC 
1501 TGAAGATACT ACACTGGTCC CCACAGTTTA CACnTTATC AGCAGTOAAT AAGGGTTCCT 
1561 CTATCCTTAC CATCATTO3T TGTAATTTTT CTTGATGACC CTCTTTCTGA CAGGGATAGG 
1621 ATGTAATATC AGTGTGAGGA AGTACAACTT GTTTTCTAAG TA-nCTATTGG CCCCTTGCAT 
1681 rrCTTCTTTT GAAAACTCTC GGTTCCTGAC ATCTGCTCAQ OTATTCATTG GATGnGTTT 
llll CTTTgSSt TGAGTTCrrA TGAATTCTAG ATCTTAAATC CCTOCCTGTG GTTCTCTCCC 
1801 ATTCTGTAGG CTGCCTCCTC ACCCTQGCAA TTGTTGTCCT TGTTTTGCAG ARACTTT^ 
1861 CTTCATGGAA TCTCATTTCT CAGTTTTCCC TCCTCTGCTA TAGCCTGAGC TAATGCACTG 
1921 GTTTTTACAG AGCCCTGGTC TATGCCTTTA TCCTCCTCTG GCAGCTTCGG AGTTTCATTT 
1981 CTTACATTTA QATCTTTCAT CCACTTTCAA CAAGTTTTGG AGCAGGGT6A GAGATACGAA 
2041 TCTAGTTCCA TTCTTCCATA TOTGATCCTA GTTTACATAG CATCGTTOGT TGAi«iA«yrvL 
2101 TTATTTTATT TTTAAATAAT GTQTCATAAA AAACGAGGTG GTrOTAGCAG TGTOaMTTG 

2161 ■rrrcrrTCTC ctttgatcta caggtcttgt tttgtgtcag tctcatqatg TmATTGcr 

2221 ATGGCTCTGT CATACAGTCT GAGGTCAGGT ATTGTGATAT ACCTTCftGTA TTGCTCCCTC 
2281 ^CTCAQGT TTGCTTTGGC CAGGAGTCAT CTTACTCAGT GCTCTTAGAG CTCCCCCAGC 
2341 ATCTAGCTOC TACTATTCTT AGTTCAIAAA TCAGGAAACT GGGGCTCAGA GA^TTAACT 
2«1 GtStSS ACTTCTGGGO AGGT^^ 

2461 TGCTCCAGTA GCTGTCGGGT GCTCGGCTAC AGCAAAGCAC CXATACIATA ^tMTACTOG 
2521 GAGGTOGAAA AACTCAGCCT CCCTTGGGQT TCCCAAGCTC C(^aTOTCC *fiJCACTGCT 
2S81 GGAAACCrCA TGGAGTCTCA AAGGAAGGGT TGAGGGTACA TGGGGCAGOQ ^^0«ftOCC 
2641 TGGGGCTGGG ATCTCCCAAA CACCTGGATA TCCAGATGCC ACTGGGTCAG OGGQAGTTCQ 
2701 GAACAQAQTT GGGMGTCCA TGGACCTGTG ACAAGGCCAG GGCCAGGGGG AGGATAACTC 
2761 tg^ttSct AATPTGCQAA AGTCCITAGC TTASCAGCAG TTGTCTGGGA GCACAGAGGG 

'I'i SSSSS J^oqSSJ gcagtgoxk: tctgiaggcg arggtcttct ccatgttc^ 

2881 CATCGTGGTT CTTCA-TCAAA GAGACAGTCC TTGGCTCCAA ACTGGTTXAT TG MTOTICA 
2941 TTGTGGAAAA TGGGTGCACA CCACCTTCTC AGQGTGOACC AGROMCTMl «fCTTTroC. 
3001 AGGGAGGAAT ATCTGGGAAG GGACGCTTAC TGGCTAAACC CTCAGGGCCT CTA^J^ 
3061 CATXAGCATO GAGAACTCTG TTCIGGGCTA CATGACCaCA GGCCACATTT CCACAAQCCa. 

Mil Sgtggcaca tgttctaggc caggaatctg gtagggagcg tggagccacc 

'Si StggotgS tgggtoccag ggaccctgaa cccgctcaac cttaccaagt 

3241 tiStogcaq ggtccactgt cctacacaqa agctogagga qgtgtgaggg ttgtgtcttt 

3301 SgSmGTC CCAIGCTGCT TGGGGCTCRfi TTTCTOCAOC TGTACCTCAT TGGTTTGQGT 
3361 iSSS^ GGGATACTTT ATTMTCTCT GACTOOCTCC XGMMAAAA 
3421 AGTCCAGGAA CCACACCCTG AGGTTCCTGC ACTCAAGGGA CTCCCIARGT CTCTGGAGTC 
3481 TCTCCOCTTC ACaGAGCIGC CAAAGTCTAG GTTCTTTTGA GGATAACAGA GCCA3X3CTTG 

Isll SScSac ^GCATTT gtttactcaa CCTTCTTTTG tcagctccct cttcataaac 
leol A^CCTGQ cttgaggaag acttcxaaag ccaqacaact gtgcaaggaa 

3^6^ SJSS SSSS agttagcctc gatgtagccc tcaaagtctc cagagaccag 

3721 CcSgSgGC TCAAGTGGAG GGCAAGACCT GCftGCRGCCA AGCATCTQGC AGGAfiAGGAT 
378i ^ACCAT GACACACATT CTTCCTOCAG GTCACACTTA ATJGGCCATT 

3841 S jS a XXI G G ATCXATCATG GTGTTCT3TG CGAGATTAAT 6AGGTCTTAT GCTGCGAACA 
SJ^JS S^CAAG TCCCCCCCCC TT^TCACTGC TGCTAA^^^ ^^^^ 
3961 TTGICrCRAG TOTCICTCTA ATCAGRAACA AIAAAGGTCT CCTTGGATTC AAGCCCT^ 
4021 GTT^C^ TCCTTGCTOA GCCTTQQACA CCCSTACaAA CCTCCIGGM GCmO^ 
40B1 TGGGCAGAGA CTCCAAGGTG GGGAGAGACT GAlGGTftCRA AAGCAAAATA OTraXTTGGG 
nil SS^ScS CTCCTCTGCC TGTGTtSGTTC CTGCAGTCAG TCCTG«^ J^SSS? 
4201 TGQGTCTTCC ATGGGCAACA CGCAGflGGGA GGCAATGGAT GGGAATACCC ACaCCCTGOT 
^ScH SSgCT CTClXSCrCTT CATCCCTCCT CXQCCCTCTG CCACX3GCTTT 
J^SS ATCATATCTT CATATIGGCC CACAGGXGTT ^C^^ ^^^J 
4381 6TTTACTTCA GAGTOACCTT AGCAGGGCTG GTGGOAATGA GTTCTAGAAG GCTC^G^ 

J441 Sgctaggga agaaacgtct tctaactact gaggttacxa asttcctggt gg^otctct 
4501 gcctttccct tgtiaaagtc accttgaagt tagtgcaqaa gaaatcaoag ccogtcaca 
llei GAGtAAAlM ggtcctcaag atttccttto agtgcccaga *^cmgaca tttca^ 

4621 CCTCTTXOTA CCTTAAGTCA TTTGGGGTTG TATCTTCTGC TTGATGTATG TGTGTGTGTT 
«11 SJSS ACMAAGftGG rOCTCU^ SSSS 

4741 bxGGCATGTC ACATCCTCAG GCCTXXJCTCT GGTGCCaVGGA GOAACTQATG CA^^AAGAG 
4801 TAAGAGGTCA TrXCCTGGAG GCTGTCACTA TAGAGGAGAT CTTAOftGTGC ATTCCCTCCT 
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Figure 23 fcontinuedV 

4861 CCAGGCCCTG CCTGAGGATA GACATGTGCT GACTGCAACT GAAACAOa^GG CTTGGQkTGG 
4921 AGAGTTAGGT TCACAGAAGG GAGGGTGGGA GATGQATGCT TGCTG GGTT C TGGGT CTCAT 
4961 CACCAGCTCC TGACCACCCG GTCAGCCCAT GTGCTTATTC CATAGCTTTC TTTTOCTATG 
5041 TTTACTCAGT GTCGTGTTTG TTGGGACCCA GCAGAAGCCA GTCCCAGGCT GACAGCTGTG 
5101 GATACACAGG GCAGCATCAG GGTCCTCAGC CTGAAGCAGT CAGGCTGGCA GAAGAGAAAG 
5161 ACCAGCAC31C ATTCCTTCAA CXaACTATGT CTTGAAAAAC AAACATATTA TATCACATAT 
5221 ATTGCATTTA TGAGACAGCT AAftATGTACT CGGCTAGCAT GACTCCAGGT GGGGATATCT 
5281 GCAAGTGCCA TGAGTGGCAG AGGGACAGCC AATGTGAGGC AAGAAOGAAT TCTOGCTCAA 
5341 CACAGCTTAG CTCCCTGGTG TTGGTTCAAA CTTTGAGAGT 1TGACCACAA GCACTTTATT 
5401 TTTGACATAT TTAAACAGAG CACAACTTTG GGAAAAAGTT TTCTTATGAA AATTATCACyi 
5461 ATAAAGCTTA AGGCATQACT ACATTAAAAT GCCTTTGCAA AGTATATGTG CCCTCTTCCA 
5521 CAAGAATGGT TCIATTGACT GAGAAATAAT GTTCaGGATA AAQATCCAGG AAGAAAflGAT 
5581 CAGGGATAAG TAAAATACTA AACTCTTTTG CAAWBTACAT AGACCSTTCTT TCATAACSVAT 
5641 GGGTTCTATT GACTGACAAG CACTGCTCAG GAGTTGGGAA AGAGTCTAGC AIAAGCACGA 
5701 TAGCCTGGAG ACTCTAGTGA GGTCTAGTCT TACAGACS^GC AAAAATCACC AGGTTACAAA 
5761 CTACATTCAT TTCCAGTTTT CTQATCAGGC ACAGGTATGA ATCCCrTCTG TTGAAGAGAA 
5821 ARGTCCATGT GTTTAAAATA TCTGGTTTCT CCAGTGCTAT TAGCGAGAAG ACTTGAGCCC 
5881 TICEACAACTC CCACCTGGAG TGACATCCrTG TCTTCAIGGT ATATTACAIA CCTAGACACG 
5941 CTCATCTCAC AGACTTAGGA CTTTCTCTTC TGATCTCCAT TTCTGATCXX: ACTTCXaCCT 
. 6001 TTGCCTTGAT AGTCTCATTT TCTTCACTGC CTTOGTGACA ACCaTCTTAT CXrTCTGTGTA 
6061 TTTGAGTGTT ACCATTTTCA GATTTTACCT GTATGCAAGA TCACACRGTC TTTGTCTTTC 
6121 TCTCTGGATG CATGCTAATC TCTACACAAC AACCCTTCCC CGTCftCTCAG ATCTTtXTTCC 
6181 ATTAACACAT ACATGGTGCT GAAGAOGCXA GGGAGCTTCC CTTCAGTGGG GAGCTAGCTG 
6241 GCTATTGGGC CTTTTTOACT GTCCAGGAAG GCCCCTAATT GCTGftGACAA GAACTTAOAT 
6301 TCTTCATTAT 1GACTCTAAC TCATGTATCA AGCAGA&GCT AAXGAATAGT TATCAAC31GG 
6361 ATCAGAGGTT CCAGTCXAAG ACACTTTGAC KIGAAASAAC GGAGGAAGGA CAGAIGGAIG 
6421 CATAAAAGCA GGACCACTGC CCCaGGRAGG TCCTGGAAAC TGATGCAGGG CRAAGOACRG 
6481 GTTATAAACC AAATCTTAGG GAGTCAGGAA GAGCACAGAG GftGCTCaACC AACTGACCAC 
6541 TGCTTAGGGG CTACCAACCC AATCCTCCCT GTGGGAACAO CTAAGCTATC AGCCARGGGT 
6601 AAIAAACAGG CAGGACCTGT GGATGACATG GAGAGCATAG GGACCCTGGG TCCAGCCTTT 
6661 RGCACCTGCA CTCTOVSGAT ACTCCACCAT TGTCTCTTAG AGAGCCTAGG GATACTGGGT 
6721 CCAGCCTTTG GTACCTTCAC TCTCftGGGTA CCCCATCACT GTGTCTTGGA GAGCCTAGGC 
6781 ACCCTGGQTC CAGCCTTCAG TACCTGCGCT CTCAQGACAC CCCACCATTG TCTCTTGCCC 
6841 aJltrXCilCT TCCTCTTCCT CCCTTTCKIT CTCTCTTCTC TGTTTCriTC TTGACTCTCC 
6901 TTTCCCCTCA CRCCCTCACT CTRGTTCTCC CCTTCCCTCT CIGCATCACC Cl ATTCTCTC 
6961 TOTGOTCCCT CCACTTTCCT TrATCTCTCA TGCTTCTCTC CTCCCTCAAA TACrTSTCaC 
7021 CCACXATACT TCAGGGGCCA GCTCTAGTGA CAAAGCTGTT AATAGCAAGA CTCTCAGATC 
7081 TCCAACGGCT CAGAGGAGCC AGACCXaVCCA AGAACTCTCT CCAGG-TCCAA TTTCAGGTTC 
7141 CTTCGAAAGC TTTCAGCAAA TCCTCAGOaA ACATGCCACT AACAAGAAGA TGCAAATTCC 
7201 AGTTGAGAGT GGGAAAGGCC CTTGCGTAGG TCOCaTCTTC C AGGCCA AGG TCAGAGGQGC 
7261 TCTGTGTAAT CCGGATTGAC AGGGCTCAGA ACAATGTTTT GTTTTTAAGG TTTATTTftTT 
7321 TTAGGTGXm GTGTCTrrGC TTGCATGACX: TTATGTGCAT CATGTQTGTO CAGGTTCCTG 
7381 ATGACRGTAG AGQAGGGCTT TGAATCCCTG GGGAIAGGAA GTTACAGGAA ATTKCAAGCX 
7441 flCTTTGTGGG TCTrCXAGCT TTCCCAACAG AA6TGAATGC TCTTCACCAC TGAGCCATCT 
7501 CTCTAGGCCC AftGAiOACATT GCTTIATGGA TATAATtGTG TGTGTGTGTC AACATTGAGG 
7561 AAAGG6AAAT AAARAAAAAA CTTCAGCOGC TAAfiOTTGm CRGTTTCACT AATTGCXACT 
7621 TTTAGTTGTC ATAAAATGGC AGGTGCTTCA ACATITATAT ATACRAAAAC TTCCCTOCIG 
7681 CTGGTTCAAC TGTGAGAACT GGGGTAAGTG GGXGAGTTCT CTTTTTCTGT CTCTGTCTCT 
7741 GTCTCTCTCC TTCCATTCTT TCTTAAAGGA AATAAACSOT GC3U3CTGGGT TATAGCTCAT 
7801 CAATA.TGGAA CTTACASAAG TOAAAAAAGG CATTGCCTTG GTGGGTCGTS TTACXaGCTG 
7861 ATTTTTGOTT GTCCTOCAAG OAflOTCrGGa GACXGGCTGC TCTGTCTCTG TCTGTATGAG 
7921 TCAGGGAAGT CTGGGGAGCA GATTCCCTAA CCXTCAGCCT QGCCTGGTTC CTGAOTCAAC 
7981 CCAGCCTCTC TCGTCCTAGT AGCTTTTTCC AAACAGQAAT CTOASTCGTG ACAflOQAACA 
8041 AGTACCAGCC CATTGCTTAA GTGCCAGGGT TAGTGAGGGC AGGAAGCTGC CAXAGCTGGG 
8101 ATTAGTAGTT GTATTGGATG TAGGAAGTCC TATCCTGGGA CAGCTAATCC TTAMOCTTC 
8161 ACTGGAGATT TTCAATQAfiA AATTIATCCC ACGGCCCATA TGGCCCCATC CTTTTSTCTC 
8221 CAACAGCCAA GTATTTTCCA TTAGAGOAGA CTTCCTGTAC ACTTGATGQA TGCTOOTCC 
8281 AAGGTGACTT GGGOCaOTCA GTACaSACTT QGGATSACCT CTGACASCCT AACCTCTCCC 
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Figure 23 foontinuedV 

B341 CAACAAGGGC CCTCTATGTT TGCTATGTAA TGTAATGTCA GACATTGTCA GGAGTGTCCG 
8401 CAGCACAGCC TGCCCAGTGT GAGGGCTCTC ATAGGTTTCC CACTGTCTrA TCTACACAGG 
8461 GATAACGAGG AGGTAAGCTG CAGTTCCCAG TCTCACTTCA CAGAGGAAOA GATAACCCCA 
8S21 TCCCAGGTCA TGTAGCCAGC AGTGGAAAGA ATGAGGATTT GAACTCAGGT CTTCCAAGTC 
8581 CCATTGATAG CATCTCCTCA CAAGTCCCTT GCCACCCTCA CGATGCCTTA GACACTTGCC 
8641 TGCCCTTTAT ACTAAGGAGA TGCAGGTACA AGGGGTTTAC CCATGTAGCA GCTGAGGCAG 
8701 CTGGGGATAG ATACCAGCAG CAGGCCTGAT GTCACCACTC TAACTCCAGC ATCCCCAGTC 
8761 TGTGTTCCTG GAGTGTGAAA ATCCCTACTT AACAAGATTG TGCAACAGTC CTTGGCTCTG 
8821 TGACCCATAG CTGGAAACAG GATTCTCATT GATTTGTGGA ACATGGTGGC AGCCAGCCAA 
8881 AAAGAGGGTC K3CATACAGA AGACACGTGT GGCAAGGCCA CAGCAGACTC TGACTACCTT 
8941 AGCTTACAGA ATTACAAGGT CATAATGTCC TCTGCTTTGG TCACCTCATG TTAAGGACAG 
9001 GCXCTAATGA AGATGGGGCA GAAGACTGAA GGAATGGCCA ACCAATAACT GGCCCAACTT 
9061 GAGACCCATC CTACAGGCAA GCATCAATTC CTGACACTAC TAATGATACT CTGTTATGCT 
9121 TGCAGACAGA AGCCTAGCAT AACTATCCTC CGAGAGGTCC ACCCAGCAAC TGACTGAAAC 
9181 AGAAAAAGAT ATCCACAGGC AAACAGTGGA TGGAGGTCAG GGACTATTAT GGGAGAGCTG 
9241 TGGGAAGGAT TAAAAACXTCT GAAGGGGATA GGAACCCCAC AGGAAGACCA ACAGAGTCAA 
9301 CTAAGAGACC TGTGGGAGCT CTCAGAGACT GAGCCACCAA CCAAAGAGCA TACACAGGCC 
9361 GGTCCGAGGC ACCXX3GCACX3 TGTCAAGCAG ACATGCAGCT CAGTCTCCAT GTAGGTCCTC 
9421 CAATAAGCGG TAGCCTGACT GCAGTATCCA ATCCCCAACA GGGCTGCATA GTCTGGCCTC 
9481 AGTGGGGGAG GATGCCCCTA ATCCTGCAGA GACTTGATGA GTGGAGAGCT ATCCAGGGGG 
9541 AACCCACCCT CTCTGAGAAG GGAATGGGGA TGGGGGAGGG ACTCTGTGAA GAGGGGACAA 
9601 GGACAAACAA GAACCTCAAA TAGGTCAGGC CCTAAAGGCT TGCTAAGTAG CMTG GCOCA 
9661 GCTCTGTCCT GTTCCTCAGC CCAAGGCTCA GCTCCCACCT GTTTCTGTGT TITrCTGGCT 
9721 TTTCATGGGC CTAGGACXTG GTGACCAGTT CAAACAATGG GGCCTGTGGA AGACACAATA 
9781 TACAAGACTA GGGACATTCC TGTTCTGCTG ACTATCCATA GCXrTGATGTA GGTGGAAGGA 
9841 CCCAATCACT GGATTTCTAC CCTTGCACAA CCTTGACAGC TGAGGGCCTC TCAGAAACCT 
9901 ATTTCTTCCA CTGAAAAATG AGACTCTCAA ATGAACGTCG TGACAATCAT CAGGCTTATT 
9961 AAAGAGGTGT ATCTAACCTG AATGGCAAGC AGACAGCAGG CAA ATGT CTG TATCAACCTC 
10021 TAGGAAGGAC AAGAACTGCT CACTGCTGCC CCCCAGGAGG CCATTTGCTG AAACAGCTGC 
10081 TCTCCTGCTG GTGCACAGGC CCTGCCTTCT CATTGCAGCC ACAGCCCCTT CCTGTCTGAA 
10141 CCTCCTGTCA GGTCACTGGG AAACAGATCA AGATGGAACA GGACAGCTCC TGATGGTAAA 
10201 TAAAAAACAG TGGTCATGGC TATTCAXAGG GGTTTATGCT TCTTCAGTCC ACACTGTGAA 
10261 GAGCTGTGGG CATGAACCAC AGTGTTOSAG GTAGAGTTGG GGTTCTGAAA TTCACAGTGG 
10321 GCTGAGCICA GTAAATOTGA GCTGGAQGTC ACTCGTGAGA CACACAGTCC TGCTGCTTCT 
10381 GTTCCCAATA TCCTGAGGAG ACGACACATC TACTTTGTTC AGAGOCCACA GTCTAGTTGA 
10441 CCTGAGAGTT ACXZAGTTTCT TATTTGTGTG TGTGTGTGTG TCT6TGTGTG TGTGTGTGTG 
10501 TGTTGTTCGT GTGTGAGTGC AGGTGCACAT ATGATAGCGT ACACGTTGAG GTCAGAGGAT 
10561 AACTAXC3U^ CGTTGTCCCC TCCTACTTTT CCTCGGACTC TGGAGAACAA ACATGG GTCC 
10621 TTATTCCAGG GGAGCAAGTC GCTGTTGGCT GACACATCTT GCTCACATAC ATTTTACCTA 
10681 GACAATGGAG CCTCCATCAG AGTATTACTT TAGCTCCTCA CCGATGGCAA TGCACCACCT 
10741 CTCTACCCAC AIAGGA6TTG GGTCTCCACA CACOXCACA CCCCCTTCAC CAAAACGTTT 
10801 TCAGTTACTT TATCTGGTAA AGTTCATCAG AGAATGAAGC CA GTAT TAAG AACATGG AAT 
10861 CATTTGGGAA CCTGGATCTA GCAATACCCC ACCCTAGATG GAGTTGCTGA GTTTTCACCT 
10921 CftGATTAIAA TTCCCCCCTA GCTTCTATGG TTTATTCTGA AACCAGGGGA ACTCGATTCC 
10981 TCCCTTTGGA CCACAGACAT CXrrGGCTTGT GAATTCACAT GTCATCTACT GCTAATCCAT 
11041 TCGTAGTATC TGGCTCACAG AGACAC^CTA CAGTCATGGC CAATGTCAAG GTAGQACAGA 
11101 TGTGAATCAT TCCCCCAGTC CTGCTGTTTT CATGACTAAC CCTCCTCAGC ACAGTG ACCA 
11161 TGAACCTACT TTTCCCCTCC TTTTATTTTT AGAATTGCTG GAA TTTTC TA TCTTGAGAAA 
11221 TAATAGCCTT GGGCAGCATT AAACAAAATC ATCTTAGAAAG CTGGTTTAAA ATACAGATGG 
11281 TTGAGTCAGT GAAAGAGTGA GGAATGTCAT TATTGGCCCC TCACAGAGGC TGGCTCACTC 
11341 CAGCAGAGGT GGTTOAAGCT CTTGGAC3^ GGTCAGGTGC ATAGGAAAGG TNGTCTGGGA 
11401 CACTCAGAAC CACAATTOAA CAAACAGAAC TGTTGGCTTT TrrTTTTTTA AATGAGTTCT 
11461 CAAAAAATGA CTGGCTAGCT TAGGCAAATA CTTCGAGCCA ACCCAACAGA ACATTCTTCC 
11521 ATTGATTCAT TCTCGATCTT CTTTCTAGAC AAXACTGAAC TQACCCCTTG TTGGCAGTCT 
11581 CAAGTTTGAC AACATAGGGC TTTGAACTTG GCACAAOGTC CATCACTGTC ACXICAAGCAT 
11641 CCTOGGTGAC CTTTGGGrTG GAATATCTTG GCTAACCTTA GATATTTTCT TTQGAGTATC 
11701 TTTAGAACAT CCAGGAAATA QGGCTTGATT CTCATCCTGG GACCACAATA TAAGTCACCC 
11761 TAGAATCCCA GGAGATCGTG CAQAGAAACA AGGATCTCTC TCGTGTGOW CCTTCTTCAA 
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Figure 23 (continued): 

11821 AGCAGTQAGT AGTGACTCCA CTAAACTGAG TTCCCATCTG 7U3AGTCCACA GGAGGCTTTG 
11881 GGGCAAGAAG CAGAGGGAAG GCACTGTTTG TGTTGGTAAA GTTTTGA.CTC TAACAAATTT 
11941 GAAGACATAG ATGACATTGT GTCAGACTAA CAACAACCTA GACTCATGTG GGTTCTGTTT 
12001 AGGGATCAGA TTTTATTCAT CAATGACTTG TCTTAGTGTA TAGAGAAAGG CTTCCTACTG 
12061 GAGTGTAGGC TCAATAATGA CAGAAGAGAT AGCTATTTCC CCTAGGGACT GTOCTGCTCC 
12121 AAGTTTGGTG GAGAAAGGCA GTGGGGAACC TAGATGTGCT CTCTGGGGAG GGGGTCTGAA 
12181 GCTGGCTTCA TAGAAGGTGT GAAGTTTTGC TGAAACATCT AAACAGAATT ATAGCTTAGG 
12241 AAAGTGAGCA GGCAAGGCAG GGAATGTGTT GCATATGTAT ATGTACATGA AT ATATTA TG 
12301 TTATAGATAC ACACACATTT GAACCTCATT TGCAGATGAC AGAAAATAGG TTATTTTGCC 
12361 TCTCTTAACT GCTAAGCACA ATGACTTCCA GTTCCATCCA TTTCCTGAAA TGCCACAATT 
12421 TCATTTTTCA TTGTGGCTGA ATAAAATTCC ATTGCAGACT GGGCCCTACT TCATCCACTC 
12481 CTGAGGGCAG GCATATCCCC TGGCTCCATT TCTTACCTAT TQrGAKGKGP. AGTGCAACTG 
12541 TCTTGTTGAA AGGCAAGCGT GAGAGAGGCA GGCACTAATT GTGGGTTTTT GTTTCITCTT 
12601 CCTGCTATGA CTCTCCATrr GTCAGAACCA AAGATCGATA AAAGCCGCCA CCATGAAAGC 
12661 CATCTTAATC CCATTTTTAT CTCTTCTGAT TCCGTTAACC CCGCAATCTG CATTOGCTCA 
12721 GAGTGAGCCX3 GAGCTGAAGC TGGAAAGTGT GGTGATTGTC AGTCGTCATG GTGTGOQTGC 
12781 TCCAACCAAG GCCACGCAAC TGATGCAGGA TGTCACCCCA GACGCATGGC CAACCTGGCC 
12841 GGTAAAACTG GGTTCGCTGA CACCGCGCGG TGGTGAGCTA ATCGCCTATC TCGGACATTA 
12901 CXJUICGCCAG CGTCTGGTAG CCGACGGATT GCTGGCGAAA AAGGGCTGCC CGCAGTCTGG 
12961 TCAGGTCGCG ATTATTGCTC ATGTCXSACGA GCGTACCCGT AAAACAGGCG AAGCCTTCGC 
13021 CGCCGGGCIG GCACCTGACT GTGCAATAAC CGTACATACC CAGGCAGATA CGTCCAGTCC 
13081 CGATCCGTTA TTTAATCCTC TA7UUVACTGG CGTTTGCCAA CTGGATAACG COAACGTGAC 
13141 TGACGCGATC CTCAGCAGGG CAGGAGGGTC AATTGCTGAC TTTACCGGGC ATCXSGCAAAC 
13201 GGCGTTTCGC GAACTGGAAC GGGTGCTTAA TTTTCCGCAA TCAAACTTGT GCCTTAAACX5 
13261 TGAGAAACAG GACGAAAGCT GTTCATTAAC GCAGGCATTA CC3^TCGGAAC TCAAGGTGAG 
13321 CGCXXSACAAT GTCTCATTAA CCGGTGCGGT AAGCCTCGCA TCAATGCTGA CGGAGATATT 
13381 TCTCCTGCAA CAAGCACAGG GAATGCCGGA GCCGGGGTGG GGAAGGATCA CCGATTCACA 
X3441 CCAGTGGAAC ACCTTGCTAA GTTTGCATAA CGCGCAATTT TATTTGCTAC A AOGC ACGCC 
13501 AGAGGTTGCC CGCAGCCGCG CCACCCCGTT ATTAGATTTG ATCAAGACAG CGTTGACQCC 
13561 CCATCCACCG CAAAAACAGG CGTATGGTGT GACATTACCC ACTTCAGTGC TGTTTATCGC 
13621 CXK3ACACGAT ACTAATCTGG CAAATCTCGG CGGCGCACTG GAGCTCAACT GGACGCTTCC 
13681 CGGTCAGCXX5 GATAACACGC CGCCAGGTGG TGAACTGGTG TTTG AACG CT GGCX3TCGGCT 
13741 AAGCGATAAC AGCCAGTGGA TTCAGGTTTC GCTGGTCTTC CAGACTTTAC AGCAGATGCG 
13801 TGAXAAAACG COGCTGTCAT TAAATACGCC GCCCGGAGAG GTGAAACTGA CCCTGGCAGG 
X3861 ATGTGAAGAG CGAAATGCXX: AGGGCATGTG TTCGTrGGCA GGTTTEACGC AAA3XX3TGAA 
13921 TGAAGCACGC ATACCCXSCTT GCAGTTTGTA AGGTACCCGG GGATCACAAC TT GCXrCT CTG 
13981 AAGAGGAAGA ACAGAAGGAT GCCACAACTC TCCTGCTGGC TACTCTCC3U3 TGGTTTCATC 
14041 TTACTTCTGA TGGCATTTCC CTCTAGAAAG TGCTACTATC ATCCACACAT TTCTACCTGA 
14101 GACCACXCAA AGGACCCTCC CAAATTCTCT TCCTCTCTGA OTAGTCTCCA CACCT6TTAC 
14161 CACCATCCCA GAATTAAAAT CCTAACTGCA CTCTGGCGTG TGACTTGCCT CAGTCCTTGC 
14221 AATAAGAGTT GTTGGCAGTG CCAGGCGTGG TGGCGCACGC CTTTAATTCC AGCACTTGGG 
14281 AGGCAGAGGC AGGCGGATTT CTGAGTTCGA GGCCA6CCTG GTCTACAGAG TGAGTTCCaWS 
14341 GACAGCCAGG 6CXATACAGA GAAACCCTGT GTCGAAAAAC CAAAAAAAAA AAAAAAJ^TT 
14401 GTTCGC3VGAG TGTGGGTTAT ATACCAGGTG GAGATTTCAA ATGAGTGGCT GAAGCTGXAG 
14461 CCAGAAGGAA CTTAGAGGAT AGCTCATAAC TXAAAAAGAA ATGTAGAGAG TAGCAGAAAC 
14521 ATTGAGAGAG TGGGCACACA GCCACTGTGT GAATGTGGCA GAACACAATC CAGCCAGCTA 
14581 TACATCCaTA AGTGTAXATT GGCGCCATCC TGACTGATGA GACACAGGAA AAC3U3ATAGA 
14641 OGGGGTTAGG TGGCCATGQC CTTTCCTGCC TGCCTCTTCC TAAGGGTCAT CTCAAGACCT 
14701 TATGCTCTCT TAACTCTTCC ATTGCTACTT AGCTTCTAGA TATCACCTCC MgLTTA OTCT 
14761 CCTTGGGTAC ATCAGTGATC CTGGTGAXAT CCAGGGCTTC CTGATTCCAT CTTTGTCATA 
14821 GAGGCTGCAA CTAAAGAGGT CTTCTTAATA CTTCACACCX: TGATGCCAAA AGGAAGACAC 
14881 AGAAGTTCAC AGAGGTGAAG TGATTCATGT AGGACATACA GTGAGCAAGC ATCAGGGTCC 
14941 GGATTATCTG ACTCTACTCT AACTTTTATG TAAATGTGCT TTATGCCATT AACACTGTCA 
15001 TTCCTGTGCT TCAGCTCTGG GAGACTCCCA AGCACTCTTA GGC ACAAGC C ACAATIAAOG 
15061 GACTCTGACA CTCTGCATTG ATTAATTAGC ATGGTGGTCT CTATGTTTCC AGATTCATGA 
15121 TTGTTTCACT TTCCATATAG GCTATGAAGG GTGTGAGGAA ATTTTTTGGG GACAGAATTG 
15181 'gaggcaatcc ACCTCTCTCA GGAAGCCTCT ATCTGGAAAA GCTTACAACT CAGGGACAGT 
15241 AACTGTAGGC CCAGTCCTTO GTGTCCAAAA TGGGTTTTAT GGTTTGAATC-aXSCAAAGCCT 
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Fi^re 23 (continued): 

1S301 TCCATGTGCT CAAAGGTTTG AACATGGAGC CTCCTCCTGG TAACACTGTA TTGGAGGCTT 
15361 TTGAGACTGG ATGCTCTTTG GTCCCATGTT TTGCTACATC ATCTGTCAAG ATATGACCCA 
1542 1 GGCATGCTAC CAGCTACCAC AGACTATGCC TCTCCAGCTT TCATGTTCTC ^ICCMCM^ 
15481 TAGACTTGTA TCTCCTAAAA ATGGAATCAA AGCAAACTTT TCCTGCATTA AGTTTTTTTT 
15541 TTTCTGTTAA GTGTTTGGTC ACAGGGACAA GAAAACACTC AATAC AGATA ATTAGTACCA 
15€01 GAGTTGAGGT TCATTGCTCT AGCAAGTTGG ATCAAATTTT TAGGGCTTTG G AACTGAT TT 
15661 ATAAGAGACA TGTAGAAGAG TCTGAAGCTG TGGGCXACAG AAGTGTCACC AGTTTTTAAG 
15721 AATAGTTTAA TACACCATGG GAATTGTGAA AATCAGAATG CTCACACAAA GGCAGACAGG 
15781 AAAACGTGAG CATGTGGCGT GTCAGAGGGC ATAAGAAGGA ACCTAGGGGG AAATGAGCXA 
15841 GAAGCCATTC GGCTACGTTA GGGAACGTGT GTGGCTGTGC TTGGCCCATG CCCTGGCAAT 
15901 CTGAATGAGG CCAAATTTTA AAGGAGTGGA CTAACTCGAT TGTCAGAGAA AATATCAAGA 
15961 CAGACCACCA CTCAGGCTAT GCCGTGTtTG XGACCGACCA GCTA CTCTX A GCCAGCTCTA 
16021 TTGTGAAATT CCAGAGCAAT TATCAGAGCA TGAAGATACA TliChGTrThG TOAAGIAAGG 
16081 GGTGTGGGTC CCTAAGTGGA TGGTGCATAA ATCTATGTAG GTGATGCCTA AGTGACACTT. 
16141 GATAATCCAA AATATCAGCA ATGTGGAATG TCTTCCAAGG AGACCTGTAG ACACACATTT 
16201 TAGAACTTTC CTCATGGCTG TAATAAATAG CTAGCTAGAA ATCATTTCCT GAAGAGGTTA 
16261 GTCTCAGTTA CGGTrCCAGG GCAAACArTC AGTGATGGCA AGGAAGGCAT TGCAGTCAGG 
16321 AGCCAAAGGT CAGCTGGTCA CATTGCATCA AGAGTAGAGA GTCAGAGTGT GAGTAGAAAG 
16381 AGGATACAGG TTATAAAACC TCACTGTCCA CTCTCAGCAA TCCATTTTCT CCTAAAAGGC 
16441 TTTACCTTCT AAAGATTTTA GTCTTCAAAA CX:AGTACCAG TAGCCTGGGA ACAAAAGTTG 
16501 AAACAAATGA GCCTTTGTGG GGCATTTCAC ACTTAAAACA GGGCATCACC TAGGAGGAGC 
16561 CCTGTGTCCA GTAGGAAGTG TGGCCTCTGT GTCAGGAATG CTCAGGCTAA TAAGGGGTCC 
16621 TCTATCTGAG GGACCXTTATO AAGATTCAAC AAGTAGTTGT GAGAATTCCC TGTAAATGGA 
16681 TGCTACCAAT TTOACATTTG TAGACCTGCT ATTGTGTGCT TCTTTATTGG GCTCTCCCAT 
16741 CTCCCAACTT TCCAACCCAT ATTCCACATT AATCCCTTCC ACCACCATGC AACACTAGGT 
16801 AGGAGAGAAG GAAGGTTAGA AGAGAAAGTG GGTATAGATC TATTTAGACT ACTTCCTGCT 
16B61 GATTAGGGGC AAGTCCAATC mCATTGrCA GGATACCTCC AACCAGCAAC CAGCAAACCA 
16921 GCAAATCAGA AACAGCAAAA GCAGCCAACA AGGCAGCACT AACCAGCAGG ATTGGGGTC6 
16981 GTAGCGTGGG AGCAGTCACT ACTGGTCTTC TCATGGCTTT GGCATTAATA CTCTCTCAAG 
17041 AAATTCCGTA ATTTTTrCCC CACCACCTGA AATTCCGTAA TTTTAAATGC AAACTATCTA 
17101 CAGCTGGCAA AAATCACATC TCTCCTAGAG CACAAGACAA ATCATAGTTA CTGGCTATTT 
17161 GCAATCTGAA GCATCTCAAT ATCCCACACC TGGGATTAAA ACAAA AACAT ATTCACATCA 
17221 CATAACTGTT TTTTTTTTCC AATTTTTTAT TAGGTATTTT CTTTATTTAC ATTTCAAATG 
17281 CXATCCCGAA AGTCCCXTTAT ACCCTCCCAC CTCCCTGCTC CCCTACACAC CCACTCCCAC 
17341 TTTTTGACCC TGGAGTTCXX: CGGTACTGGG GCATAXAAAG TTTGCAAGAC CAAGGGGCXTT 
17401 CTCTTCCCAG TCATGGCCGA CTAAGCCATC TTCTGCTACA TATGCAGATA GAGACACGAG 
17461 CTCTGGGGCT ACTAGTTAGT TCATArrGTT GTTCCACCTA TAG GGTCGC A GACCCCTTCA 
17521 GCTCCTXGGG TACTTTGTCT AGCTOCTCCA CTGGGGGCTC TGTGTTTTAT CTAATAGATG 
17581 ACTGTGAGCA TXXaVClTCTG TATTTGACAG GCACTGGCCT AGCX3TCACAT GAGCCAGCTA 
17641 TATCAGGGTC CTTTCAGCAA AACCTTGCTG GCATGTGCaA TAGTGTCTGC GTTTGGTGGT 
17701 TCATTATGQG ATGGATCCAC TAGTTCTAGA GC 
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SEQUENCE LISTING 

<110> University of Guelph 
Forsberg, Cecil W. 
Golovan, Sergeui 
Phillips, John P. 

<120> Transgenic Animals Expressing Salivary Proteins 

<130> U Guelph - 03 

<140> 
<141> 

<1S0> 60/130,508 
<151> 1999-04-23 

<160> 7 

<170> Patentin Ver. 2.1 

<210> 1 
<211> 20623 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Lama2/APPA 
plasmid 



<40G> 1 



tcgagagtat 


ctttgtcagc 


tgtgcctcca 


acaaaggggt 


actgttgccc 


acatagaaag 


60 


atctaaacta 


attaattaat 


ccctcacccg 


caaatctttc 


agtcactaag 


ttagcacgat 


120 


tgttgaacaa 


gttctccaaa 


ggagagatac 


agatgagtgc 


gtatagggtg 


gacctggctg 


180 


ctgaggagac 


acctgcatct 


gactaagaag 


agccacggtg 


ttagttgaat 


ggtgtggagt 


240 


^gggtggttc 


tgtgggacag 


tagaaaatcg 


agaggcatgt 


gccgtttagt 


gaactgatgg 


300 


aagctacccc 


aaacgacaga 


gattgtcagt 


caggccaatc 


cgtttcgagt 


ttgatgggca 


360 


gccggacagt 


gagacagaca 


cacctactca 


gttggaggaa 


ggatgagaac 


aatggccagc 


420 


agggattgag 


agaccctgac 


aggcgcaagg 


ccctaacaca 


cacacctacc 


acctcacttg 


480 


acaaagctgc 


caaagaccaa 


agacttgttc 


tccattagaa 


atgacagctg 


gcttgacccg 


540 


acagcataat 


aagcagagtg 


tactctgatt 


ggagaacttt 


aatgtgtttc 


attcagtatt 


600 


ataaaaggac 


agtattacag 


attttgttgt 


acactgctgt 


tacatgtggg 


gcagtgtgtc 


660 


tttaagtagg 


gtaaagtact 


ctttaaaaat 


gggtcctaga 


tattttttcc 


tttaactcaa 


720 


gtctcttact 


gtttaaatga 


tttttatttt 


gtttaatatg 


gaggaaaaag 


aagcgtaaat 


780 


ggacaatata 


tatttagaga 


aagatggtta 


gctgtcagaa 


aaatatgcaa 


atcaaaatca 


840 


caccaagact 


gcagcacacc 


cctgtcagat 


ggctgtgatc 


aagaaaataa 


atgacaatga 


900 


gtggtggtga 


agatgtacta 


aagggaaaca 


cacacacaca 


cacacacaca 


cacacacaca 


960 


cacactggag 


caaccactgt 


ggaaatcagt 


atgaatggtc 


ctcaaaaacc 


tgaa^ataga 


1020 
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gcggggcgtg gtggcataca cttttattcc cagcactggg gaggcagagg caggtggatc 1080 
tctgagttcc aggccagcct ggtctatagc acaggttcta ggacagccag ggctacacag 1140 
aaaaaccctg ccttgattaa accaaaccaa accaaaccaa accaaaccaa accaaaccaa 1200 
accaaaccaa accaaaccag accaaaccaa aacactgaag atagaacttc agtattccat 1260 
tcctagatat atacccaatg gagactaagt cagcaagaca cctgcacagc catgttcact 1320 
actacactgt tcaccacagc caggctgtgg aaccagcctg agtgtccatg ataaatgaat 1380 
ggataggtaa ctttcaaggt aaatggactc tgctgtgtac atgcctcaca ttctgtttat 1440 
tcatttttct ttatgaggtg tccattcagg agtcacatgg tagttctatt ttcagtcttc 1500 
tgaagatact> acactggtcc ccacagttta cacttttatc agcagtgaat aagggttcct 1560 
ctatccttac catcatttgt tgtaattttt cttgatgacc ctctttctga cagggatagg 1620 
atgtaatatc agtgtgagga agtacaactt gttttctaag tatttattgg ccccttgcat 1680 
ttcttctttt gaaaactgtc ggttcctgac atctgctcag gtattcattg gatgttgttt 1740 
ctttggtgtt tgagttctta tgaattctag atgttaaatc cctgcctgtg gttctctccc 1800 
attctgtagg ctgcctcctc accctggcaa ttgttgtcct tgttttgcag aaacttttga 1860 
cttcatggaa tctcatttgt cagttttccc tcctctgcta tagcctgagc taatgcactg 1920 
gtttttacag agccctggtc tatgccttta tcctcctctg gcagcttcgg agtttcattt 1980 
cttacattta gatctttgat ccactttgaa caagttttgg agcagggtga gagatacgaa 2040 
tctagttcca ttcttccata tgtgatccta gtttacatag catcgttggt tgaagaggtt 2100 
ttattttatt tttaaataat gtgtcataaa aaacgaggtg gttgtagcag tgtggatttg 2160 
tttctttgtc ctttgatcta caggtcttgt tttgtgtcag tctcatgatg ttttattgct 2220 
atggctctgt catacagtct gaggtcaggt attgtgatat accttcagta ttgctccctc 2280 
agactcaggt ttgctttggc caggagtcat cttactcagt gctcttagag ctcccccagc 2340 
atgtagctgc tactattctt agttgataaa tcaggaaact ggggctcaga gagattaact 2400 
gtcttgaact acttctgggg aggtgaaacg tggagacact aaactgtgtt taccctgtac 2460 
tgctccagta gctgtcgggt gctgggctac agcaaagcac ctatactata tattactcag 2520 
gaggtggaaa aactcagcct cccttggggt tcccaagctc ccaggtgtcc agtcactgct 2580 
ggaaacctca tggagtctga aaggaagggt tgagggtaca tggggcagcg atgaggagcc 2640 
tggggctggg atctcccaaa cacctggata tccagatgcc actgggtcag ggggagttgg 2700 
gaacagagtt gggatgtcca tggacctgtg acaaggccag ggccaggggg aggataactc 2760 
tggctttact aatttgcgaa agtccttagc ttagcagcag ttgtctggga gcacagaggg 2820 
gccttctgta agaggctcag gcagtgccgc tctgtaggcg aaggtcttct ccatgttccc 2880 
catggtggtt cttgatgaaa gagacagtcc ttggctccaa actggtttat tgattgttca 2940 
ttgtggaaaa tgggtgcaca ccaccttctc agggtggacc agagatcaaa taccttttgc 3000 
agggaggaat atctgggaag ggacgcttac tggctaaacc ctcagggcct ctagatacat 3060 
cattagcatg gagaactctg ttctgggcta catgaccaca ggccacattt ccacaagcca 3120 
catgtgggaa gtgtggcaca tgttctaggc caggaatctg gtagggagcg tggagccacc 3180 
taccatx:cca gg t ggg tg cc tgggtgccag ggaccctgaa cccgctcaac cttaccaagt 3240 
ttcctggcag ggtccactgt cctacacaga agctggagga ggtgtgaggg ttgtgtcttt 3300 
gtggaatgtc ccatgctgct tggggctcag tttctccacc tgtacctcat tggtttgggt 3360 
ataaaaagtg gggatacttt attattctct gactcggtcc tgaggaaaaa gcatcgtggc 3420 
agtccaggaa ccacaccctg aggttcctgc actgaaggga ctccctaagt ctctggagtc 3480 
tctccccttc acagagctgc caaagtctag gttcttttga ggataacaga gccatgcttg 3540 
gtaagcagac aacagcattt gtttactcaa ccttcttttg tcagctccct cttcataaac 3600 
aagttgagac accatgctgg cttgaggaag acttctaaag ccagacaact gtgcaaggaa 3660 
gaagaagaag gggcaagtgg agttagcctg gatgtagccc tcaaagtctc cagagaccag 3720 
ccatgaaggc tcaagtggag ggcaagacct gcagcagcca agcatctggc aggagaggat 3780 
cctgggaacc cctctaccat gacacacatt cttcctgcag gtcacactta ataggccatt 3840 
tcttatttgg atctatcatg gtgttctgtg cgagattaat gaggtgttat gctgcgaaca 3900 
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gaaagttata taaaaacaag tccccccccc ttgtcactgc tgctaagaat gtagcagaaa 3960 
ttgtctcaag tgtctctcta atcagaaaca ataaaggtct ccttggattc aagccctcca 4020 
gtttcctcct tccttgctga gccttggaca cccatacaaa cctcctggat gctacagctc 4080 
tgggcagaga ctccaaggtg gggagagact gatggtacaa aagcaaaata cttgtttggg 4140 
ggtacaccca ctcctctgcc tgtgtggttc ctgcagtcag tcctgcagac aggccctcag 4200 
tgggtcttcc atgggcaaca cgcagaggga ggcaatggat gggaataccc acaccctggt 4260 
tagtttaccc cggccatgct ctctgctctt catccctcct ctgccctctg ccacggcttt 4320 
ctctgcagga atcatatctt catattggcc cacaggtgtt ctcctcaccc tagctatgat 4380 
gtttacttta gagtgacctt agcagggctg gtgggaatga gttctagaag gctcacggag 4440 
atgctaggga agaaacgtct tctaactact gaggttacta agttcctggt ggttgtctct 4500 
gcctttccct tgttaaagtc accttgaagt tagtgcagaa gaaatcagag cccagtcaca 4560 
gagtaaatat gg'ccctgaag atttcctttg agtgcecaga atccatgaca tttcaagagc 4620 
cctctttgta ccttaagtca tttggggttg tatcttctgc ttgatgtatg tgtgtgtgtt 4680 
tatcaaagag tgagatggtt acataagagg tgctctaaag gacagagagg atttgcaatt 4740 
gtggcatgtg acatcctcag gccttgctct ggtgccagga ggaactgatg cagaaaagag 4800 
taagaggtca tttcctggag gctgtcacta tagaggagat cttacacftgc attccctcct 4860 
ccaggccctg cctgaggata gacatgtgct gactgcaact gaaacagagg cttgggatgg 4920 
agagttaggt tcacagaagg gagggtggga gatggatgct tgctgggttc tgggtctcat 4980 
caccagctcc tgaccacccg gtcagcccat gtgcttattc catagctttc ttttgctatg 5040 
tttactcagt gtggtgtttg ttgggaccca gcagaagcca gtcccaggct gacagctgtg 5100 
gatacacagg gcagcatgag ggtcctcagc ctgaagcagt caggctggca gaagagaaag 5160 
accagcacac attccttcaa ccaactatgt cttgaaaaac aaacatatta tatcacatat 5220 
attgcattta tgagacagct aaaatgtact cgggtagcat gactccaggt ggggatatct 5280 
gcaagtgcca tgagtggcag agggacagcc aatgtgaggc aagaaggaat tctggctcaa 5340 
cacagcttag ctccctggtg ttggttcaaa ctttgagagt ttgaccacaa gcactttatt 5400 
tttgacatat ttaaacagag cacaactttg ggaaaaagtt ttcttatgaa aattatcaca 5460 
ataaagctta aggcatgact acattaaaat gcctttgcaa agtatatgtg ccctcttcca 5520 
caagaatggt tctattgact gagaaataat gttcaggata aagatccagg aagaaaagat 5580 
cagggaeaag taaaatacta aactcttttg caaagtacat agaccctctt tcataacaat 5640 
gggttctatt gactgacaag cactgctcag gagttgggaa agagtctagc ataagcacga 5700 
tagcctggag actctagtga ggtctagtct tacagacagc aaaaatcacc aggttacaaa 5760 
ctacatt:cat ttccagtttt ctgatcaggc acaggtatga atcccttctg ttgaagagaa 5820 
aagtccatgt gtttaaaata tctggtttct ccagtgctat tagcgagaag acttgagccc 5880 
tatacaactc ccacctggag tgacatcctg tcttcatggt atattacata cctagacacg 5940 
ctcat^ctcac agacttagga ctttgtcttc tgatctccat ttctgatccc acttccacct 6000 
ttgccttgat agtgtcattt tcttcactgc cttggtgaca accatgttat cctctgtgta 6060 
ttt:gagt9tt accattttca gattttacct gtatgcaaga tcacacagtc tttgtctttc 6120 
tgtct:ggatg catgctaatc tctacacaac aacccttccc cgtcactcag atcttcctcc 6180 
attaacacat acatggtgct gaagaggcta gggagcttcc cttcagtggg gagctagctg 6240 
gctattgggc ctttttgact gtccaggaag gcccccaatt gctgagacaa gaacttagat €300 
tcttcattat tgactctaac tcatgtatca agcagaagct aatgaatagt tatcaacagg 6360 
atcagaggtt ccagtgtaag acactttgac atgaaagaac ggaggaagga cagatggatg 6420 
cataaaagca ggaccactgc cccaggaagg tcctggaaac tgatgcaggg caaaggacag 6480 
gttataaacG aaatcttagg gagtcaggaa gagcacagag gagctcaacc aactgaccac 6540 
tgcttagggg ctaccaaccc aatcctccct gtgggaacag ctaagctatc agccaagggt 6600 
aataaacagg caggacctgt ggatgacatg gagagcatag ggaccctggg tccagccttt 6660 
agcacctgca ctctcaggat actccaccat tgtgtcttag agagcctagg gatactgggt 6720 
ccagcctttg gtaccttcac tctcagggta ccccatcact gtgtcttgga gagcctaggc 6780 
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accctgggtc cagccttcag tacctgcgct 
cgtctcttct tcctcttcct ccctttcatt 
tttcccctca caccctcact ctagttctcc 
tgtggtccct ccactttcct ttatctctca 
ccactatact tcaggggcca gctctagtga 
tccaacggct cagaggagcc agacc caeca 
cttcgaaagc tttcagcaaa tgctcaggga 
agttgagagt gggaaaggcc cttgcgtagg 
tctgtgtaat ccggattgac agggctcaga 
ttaggtgtta gtgtctttgc ttgcatgacc 
atgacagtag aggagggctt tgaatccctg 
getttgtggg tcttctagct ttcceaacag 
ctctaggccc aagagacatt gctttatgga 
aaagggaaat aaaaaaaaaa cttcagccgc 
tttagttgtg ataaaatggc aggtgcttca 
gtggttcaac tgtgagaact ggggtaagtg 
gtctctctcc ttccattctt tcttaaagga 
caatatggaa gttacagaag tgaaaaaagg 
atttttggtt gtcctgcaag gaggtctggg 
tgagggaagt ctggggagca gattccctaa 
ccagcctctc tggtcctagt agctttttcc 
agtaccagcc cattgcttaa gtgccagggt 
attagtagtt gtattggatg taggaagtcc 
actggagatt ttcaatgaga aatttatccc 
caacagccaa gtattttcca ttagaggaga 
aaggtgactt ggggcagtca gtacagactt 
caacaagggc cctctatgtt tgctatgtaa 
cagcacagcc tgcccagtgt gagggctctc 
gataacgagg aggtaagctg cagttcccag 
tcccaggtca tgtagccagc agtggacuiga 
ccattgatag catct:cctca caagtccctt 
tgccctttat actaaggaga tgcaggtaca 
ctggggatag ataccagcag caggcctgat 
tgtgttcctg gagtgtgaaa atccctactt 
tgacccatag ctggaaacag gattctcatt 
aaagagggtc tgcatacaga agacacgtgt 
agcttacaga attacaaggt cataatgtcc 
gccctaatga agatggggca gaagactgaa 
gagacccatc ctacaggcaa gcatcaattc 
tgcagacaga agcctagcat aactatcctc 
agaaaaagat atccacaggc aaacagtgga 
tgggaaggat taaaaaccct gaaggggata 
ctaagagacc tgtgggagct ctcagagact 
ggtccgaggc acctggcacg tgtgaagcag 
caataagcgg tagcctgact gcagtatcca 
agtgg^ggag gatgccccta atcctgcaga 
. aacccaccct ctctgagaag ggaatgggga 
ggacaaacaa gaacctcaaa taggtcaggc 



ctcaggacac cccaccattg tctcttgccc 6840 
gtctcttctc tgtttctttc ttgactctcc 6900 
ccttccctct ctgcatcacc ctattctctc 6960 
tgcttctctc ctccctcaaa tacttgtcac 7020 
caaagctgtt aatagcaaga ctctcagatc 7080 
agaactctct ccaggtccaa tttcaggttc 7140 
acatgccact aacaagaaga tgcaaattcc 7200 
tcccatcttc caggccaagg tcagaggggc 7260 
acaatgtttt gtttttaagg tttatttatt 7320 
ttatgtgcat catgtgtgtg caggttcctg 7380 
gggataggaa gttacaggaa attataagct 7440 
aagtgaatgc tcttcaccac tgagccatct 7500 
tataattgtg tgtgtgtgtc aacattgagg 7560 
taaggttgta cagtttcact aattgcfeact 7620 
acatttatat atacaaaaac ttccctgctg 7680 
ggtgagttct ctttttctgt ctctgtctct 7740 
aataaacatt gcagctgggt tatagctcat 7800 
cattgccttg gtgggtggtg ttaccagctg 7860 
gactggctgc tctgtctctg tctgtatgag 7920 
ccttcagcct ggcctggttc ctgagtgaac 7980 
aaacaggaat ctgagtggtg acagggaaca 8040 
tagtgagggc aggaagctgc catagctggg 8100 
tatcctggga cagctaatcc ttaatgcttc 8160 
acggcccata tggccccatc cttttgtctc 8220 
cttcctgtac acttgatgga tgctcattcc 8280 
gggatgacct ctgacagcct aacctctccc 8340 
tgtaatgtca gacattgtca ggagtgtccg 8400 
ataggtttcc cactgtctta tctacacagg 8460 
tctcacttca cagaggaaga gataacccca 8520 
atgaggattt gaactcaggt cttccaagtc 8580 
gccaccctca cgatgcctta gacacttgcc 8640 
aggggtttac ccatgtagca gctgaggcag 8700 
gtcaccactc taactccagc atccccagtc 8760 
aacaagattg tgcaacagtc cttggctctg 8820 
gatttgtgga acatggtggc agccagccaa 8860 
ggcaaggcca cagcagactc tgactacctt 8940 
tctgctttgg tcacctcatg ttaaggacag 9000 
ggaatggcca accaataact ggcccaactt 9060 
ctgacactac taatgatact ctgttatgct 9120 
cgagaggtcc acccagcaac tgactgaaac 9180 
tggaggtcag ggactattat gggagagctg 9240 
ggaaccccac aggaagacca acagagtcaa 9300 
gagccaccaa ccaaagagca tacacaggcc 9360 
acatgcagct cagtctccat gtaggtcctc 9420 
atccccaaca gggctgcata gtctggcctc 9480 
gacttgatga gtggagagct atccaggggg 9540 
tgggggaggg actctgtgaa gaggggacaa 9600 
cctaaaggct tgctaagtag cagtggccca 9660 
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gctctgtcct gttcctcagc ccaaggctca 
tttcatgggc ctaggacttg gtgaccagtt 
tacaagacta gggacattcc tgttctgctg 
cccaatcact ggatttctac ccttgcacaa 
atttcttcca ctgaaaaatg agactctcaa 
aaagaggtgt atctaacctg aatggcaagc 
taggaaggac aagaactgct cactgctgcc 
tctcctgctg gtgcacaggc cctgccttct 
cctcctgtca ggtcactggg aaacagatca 
taaaaaacag tggtcatggc tattcatagg 
gagctgtggg catgaaccac agtgttcgag 
ggtgagctca gtaaatgtga gctggaggtc 
gttcccaata tcctgaggag acgacacatc 
cctgagagtt accagtttct tatttgtgtg 
tgttgttcgt gtgtgagtgc aggtgcacat 
aactatcagg cgttgtcccc tcctactttt 
ttattccagg ggagcaagtc gctgttggct 
gacaatggag cctccatcag agtattactt 
ctctacccac ataggagttg ggtctccaca 
tcagttactt tatctggtaa agttcatcag 
catttgggaa cctggatcta gcaatacccc 
cagattataa ttccccccta gcttctatgg 
tccctttgga ccacagacat cctggcttgt. 
tggtagtatg tggctcacag agacacacta 
tgtgaatcat tcccccagtc ctgctgtttt 
tgaacctact tttcccctcc ttttattttt 
taatagcctt gggcagcatt aaacaaaatc 
ttgagtcagt gaaagagtga ggaatgtcat 
cagcagaggt ggttgaagct cttggacacg 
cactgagaac cacaattgaa caaacagaac 
caaaaaatga ctggctagct taggcaaata 
attgattcat tctggatctt ctttctagac 
caagtttgac aacatagggc tttgaacttg 
cctgggtgac ctttgggttg gaatatcttg 
tttagaacat ccaggaaata gggcttgatt 
tagaatccca ggagatcgtg cagagaaaca 
agcagtgagt agtgactcca ctaaactgag 
gggcaagaag cagagggaag gcactgtttg 
gaagacatag atgacattgt gtcagactaa 
agggatcaga ttttattcat caatgacttg 
gagtgtaggc tcaataatga cagaagagat 
aagtttggtg gagaaaggca gtggggaacc 
gctggcttca tagaaggtgt gaagttttgc 
aaagtgagca ggcaaggcag ggaatgtgtt 
ttatagatac acacacattt gaacctcatt 
tctcttaact gctaagcaca atgacttcca 
tcatttttca ttgtggctga ataaaattcc 
ctgagggcag gcatatcccc tggctccatt 



gctcccacct 


a tl tl t c ta t a 

w w w vy ^y ^ 


v>u w>UL>&yy^u 




caaacaatigg 


yy *»y ^y y<* 


dy du>d(.«ddud 


-7 / o u 


actiaticca^ta 


y v^v* uy auy Ud 


y g t ggaagga 






^ydyy^cctc 


ccagaaaccc 






uydCaduCdU 


caggcccacc 




s\cts\^ r'is c^r* a r^rt 
ciy aU'dy L>ciy ^ 


CdddUyCCug 


tatcaacctc 


10020 


v,. dy y cLy ^ 


ccduuugccg 


aaacagctgc 


10080 


CGlL.L.yCa.yCC 


acagccccuC 


cctgtctgaa 


10140 


clyGlL.yyaclCcl 


ggacagcucc 


tgatggtaaa 


10200 


yy ucuatycu 


tcttcagtcc 


acactgtgaa 


10260 


guagagccgg 


ggttictgaaa 


ttcacagtgg 


10320 


actcgtgaga 


cacacagtcc 


tgctgcttct 


10380 


caccucgccc 


agaggccaca 


gtctagttga 


10440 


tgtgtgtgtg 


tgtgtgtgtg 


tgtgtgtgtg 


10500 


atgatagcgt 


acacgttgag 


gtcagaggat 


10560 


cctcggactc 


tggagaacaa 


acatgggtcc 


10620 


gacacatctt 


gctcacatac 


attttaccta 


10680 


tagctcctca 


ccgatggcaa 


tgcaccacct 


10740 


cacccccaca 


cccccttcac 


caaaacgttt 


10800 


agaatgaagc 


cagtattaag 


aacatggaat 


10860 


accctagatg 


gagttgctga 


gttttcacct 


10920 


tttattctga 


aaccagggga 


actcgattcc 


10980 


gaattcacat 


gtcatctact 


gctaatccat 


11040 


cagtcatggc 


caatgtcaag 


gtaggacaga 


11100 


catgactaac 


cctcctcagc 


acagtgacca 


11160 


agaattgctg 


gaattttcta 


ttttgagaaa 


11220 


atctagaaag 


ctggtttaaa 


atacagatgg 


11280 


fcattggcccc 


tcacagaggc 


tggctcactc 


11340 


ggtcaggtgc 


ataggaaagg 


tngtctggga 


11400 


tgccggccct 




aatgagttct 


11460 


ct:^cgagcca 


acccaacaga 


acattcttcc 


11520 


aatactgaac 


tgaccccttg 


ttggcagtct 


11580 


gcacaaggt:c 


catcactgtc 


acccaagcat 


11640 


gctaacctta 


gatattttct 


ttggagtatc 


11700 


cccacccwgg 


gaccacaata 


taagtcaccc 


11760 


aggaucccuC 


* 1 j-mj-m >« J-LJ 1 1 JIJ Jl. _ll_ 1 

ccgugcgca^ 


cctccctcaa 


11820 


vvcccacccg 


agagtccaca 


ggaggctttg 


11880 


tgccggcaaa 


gccucgacuc 


taacaaattt 


11940 




gac uca wycg 


ggccccgc c c 


12000 


u ^Gly uy 


uagagaaagg 


ccfccccacug 


120o0 


ciy d U>clt«l«> WCC 


cccagggacu 


gtgctgctcc 


1^120 


ucLycluy uyCb 


cucuggggag 


ggggtctgaa 


12180 


ugaaacaucc 


aaacagaarc 


atagcttagg 


12240 


gcatatgtat 


atgtacatga 


atatattatg 


12300 


tgcagatgac 


agaaaatagg 


ttattttgcc 


12360 


gttccatcca 


tttcctgaaa 


tgccacaatt 


12420 


attgcagact 


gggccctact 


tcatccactc 


12480 


tcttacctat 


tgtgaagaga 


agtgcaactg 


12S40 
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tcttgttgaa aggcaagcgt gagagaggca ggcactaatt gtgggttttt gtttcttctt 12600 
cctgctatga ctctccattt gtcagaacca aagatcgata aaagccgcca ccatgaaagc 12 6 
catSaatc ccatttttat ctcttctgat tccgttaacc ccgcaatctg cattcgctca 12720 
aagtgagccg gagctgaagc tggaaagtgt ggtgattgtc agtcgtcatg gtgtgcgtgc 2780 
rcaaccaag gccacgcaac tgatgcagga tgtcacccca gacgcatggc caacctggcc 4 
ogtaalactg ggttggctga caccgcgcgg tggtgagcta atcgcctatc tcggacatta 12900 
fcaarccag cg-Sgtag ccgacggatt gctggcgaaa aagggctgcc cgcagtccgg x 
tcLLgcg attattgctg atgtcgacga gcgtacccgt aaaacaggcg aagccttcgc 13020 
eg cgggctg .cacctgact gtgcaaeaac cgtacatacc caggcagata cgtccagecc 0 
cgatccgtt! tttaatcctc taaaaactgg cgtttgccaa ctggataacg cgaacgtgac 1 140 
tgacgcgatc ctcagcaggg caggagggtc aattgctgac tttaccgggc a-ggcaaac l 
gg^getLgc gaactggaac gggtgcttaa ttttccgcaa tcaa.ct.gt gccttaaacg 1 260 
tgagaaacag gacgaaagct gttcattaac gcaggcatta ccatcggaac tcaaggtgag 13320 
cgccgacaa? gtctcattaa ccggtgcggt aagcctcgoa tcaatgctga cggagatatt l 3 o 
"LctgcaaLagcacagg gaa.gccgga gccggggtgg ggaagga.ca . . 440 

ccag^ggaac accttgctaa gtttgca^a cgc^caa tat^tgctac a^g g ^^^^^ 
agaggttgcc cgcagccgcg ccaccccgtt attagacctg atca a s ,,620 
cS!ccaLg cLaaacagg cgtatggtgt gacattaccc acttcagtgc tgtttatcgc 1 620 
^acacgat actaatctgg caaatctcgg cggcgcactg gagctcaact 99-9^^^- 
Jgtcagccg gataacacgc cgccaggtgg tgaactggtg tttgaacgct ^90^^<^^t 13740 
cggtcagc ay «.^^,„otttc actggtcttc cagactttac agcagatgcg 13800 

'^^^ r/o 

.ZJ^^ J. "C,.-. .™ .3«0 

~: =4: ™ »o.« 

tt!cttctga tggcatttcc ctctagaaag tgctactatc atccacacat ttctacctga 14100 

=i =r. s= 

Z^^t ^tggcagtg ccaggcgtgg tggcgcacgc ctttaattcc agcacttggg 14280 

Ht. ^ 
Li^ ^ ^= ^= 

I^^n«9 tgggoacc. ,=c.otgtgt g«tgtggca gaacactc ''•9<==»8ct. 1.5.0 
tSS a:S«"tt g^tcc t^act^taa gac^ a"^"^. »«• 
■»=aalta« tagccatggc ctttectscc tgoctcttcc tMgggtcat ctcagacot 14700 
2:««fco .tt^ct. ««.=t..a «~ ..at„3t t 

=ra := = ^ 

:= = = = =5 b: 

SLtgtgct tcagctctgg gagaCccca agcactctt. «cacgc= 
gaeJgL ^« c^^::^ 

ssrj^ '~ — ' rn!::r. 

ccagtccttg ,tgt=caa«, tgggttttat ggtttg-tc «^^t «30. 
caaggtttg «.=.tgg.9c ctcotcctM taacaotgw "««g=" 153.0 
t^SSgg a«cS«g gtccctgtt t«=t«=atc atc^tcg a«.:g.==ca 1S«0 
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ggcatgctac cagctaccac agactatgcc 
tagacttgta tctcctaaaa atggaatcaa 
tttctgttaa gtgtttggtc acagggacaa 
gagttgaggt tcattgctct agcaagttgg 
ataagagaca tgtagaagag tctgaagctg 
aatagtttaa tacaccatgg gaattgtgaa 
aaaacgtgag catgtggcgt gtgagagggc 
gaagccattc ggctacgtta gggaacgtgt 
ctgaatgagg ccaaatttta aaggagtgga 
cagaccacca ctcaggctat gccgtgtttg 
ttgtgaaatt ccagagcaat tatcagagca 
ggtgtgggtc cctaagtgga tggtgcataa 
gataatccaa aatatcagca atgtggaatg 
tagaactttg ctcatggctg taataaatag 
gtctgagtta cggttccagg gcaaacattc 
agccaaaggt cagctggtca cattgcatca 
aggatacagg ttataaaacc tcactgtcca 
tttaccttct aaagatttta gtcttcaaaa 
aaacaaatga gcctttgtgg ggcatttcac 
cctgtgtgca gtaggaagtg tggcctctgt 
tctatctgag ggaccctatg aagattcaac 
tgctaccaat ttgacatttg tagacctgct 
ctcccaactt tccaacccat attccacatt 
aggagagaag gaaggttaga agagaaagtg 
gattaggggc aagtccaatc gtcattgtca 
gcaaatcaga aacagcaaaa gcagccaaca 
gtagcgtggg agcagtcact actggtcttc 
aaattccgta attttttccc caccacctga 
cagctggcaa aaatcacatc tctcctagag 
gcaatctgaa gcatctcaat atcccacacc 
cataactgtt ttttttttcc aattttttat 
ctatcccgaa agtcccctat accctcccac 
tttttgaccc tggagttccc cggtactggg 
ctcttcccag tgatggccga ctaagccatc 
ctctgggggt actagttagt tcatattgtt 
gctccttggg tactttigtct agctcctcca 
actgtgagca tccacttctg tatttgacag 
tatcagggtc ctttcagcaa aaccttgctg 
tgattatggg atggatccac tagttctaga 
ttgttccctt tagtgagggt taattgcgcg 
tgtgtgaaat tgttatccgc tcacaattcc 
taaagcctgg ggtgcctaat gagtgagcta 
cgctttccag tcgggaaacc tgtcgtgcca 
gagaggcggt ttgcgtattg ggcgctcttc 
ggtcgttcgg ctgcggcgag cggtatcagc 
agaatcaggg gataacgcag gaaagaacat 
ccgtaaaaag gccgcgttgc tggcgttttt 
caaaaatcga cgctcaagtc agaggtggcg 



tctccagctt: 


tcatatitictc 


cccaccAticfa 


15480 


agcaaacttt 


tcctacatta 


aatttttttt 


15540 


aaaaacactc 


aa t!. a AGia t~ a 


at' t" AfTt*jir*r«a 


15600 


atzcaaatititt 




aa a t" aa t* t" t" 


15660 




aaflt*r»f" r'a 
day wy ir^ck^i^ 


ant* t" t- h ^ a an 


JL 9 r ^ V 






#^fT o a is /*• a 








a a a ■Hrta #*T/^t" a 
acLduyayCta 






u uyy CCCcI uy 


cccuggcaac 


1 con A 


\^ \m oleic \m («y a L. 


uyccagagaa 


aatatcaaga 




d a c c a 


^▼^^ ^ #^ i^fr* fa 

gcuacucuca 


gccagctcta 


1 /r o A 


^r*t a 9 a ^ A ^ ^ 

cga£igataca 


tacagtittEg 


tgaagtaagg 


16080 


atctatgtag 


gtigatgcctia 


agtgacactt 


16140 


tcttccaagg 


agacctgtag 


acacacattt: 


16200 


ctagctiagaa 


4w ^ ^ ^ A» M ^ 

atcacttcct 


gaagaggtta 


16260 


agtgatggca 


aggaaggca t 


tgcagtcagg 


16320 


agagtagaga 


gtcagagCgt: 


gagtiagaaag 


16380 


ctctcagcaa 




cctaaaaggc 


16440 


ccagtaccag 


tagcctggga 


acaaaagttg 


16500 


act^t^aaaaca 


gggcat:cacc 


taggaggagc 


16560 


gtcaggaatg 


ctcaggctaa 


taaggggtcc 


16620 


aagtagttgt 


gagaattccc 


tgtaaatgga 


16660 


attgtgtgct: 


tctttattgg 


gctctcccat 


16740 


aatcccttcc 


accaccatgc 


aacactaggt 


16800 


ggtatagatc 


tatttagact 


acttcctgct 


16860 


ggatacctcc 


aaccagcaac 


cagcaaacca 


16920 


aggcagcact 


aaccagcagg 


attggggtcg 


16980 


tcatggcttt 


ggcattaata 


ctctctcaag 


17040 


aa^tccgtaa 


ubucaaacgc 


aaactatcta 


17100 


cacaagacaa 


at: c a t: agt: t: a 


ctggctattt 


17160 


t;gggat:t;aaa 


acaaaaaca^ 


at:t:cacat:ca 


17220 


caggcacccc 




at^ttcaaatg 


17280 


cuccccgccc 


ccctzacacac 


CGactiCGcac 


17340 


gcacacaaag 


uucgcaagac 


caaggggcct: 


1 Til OA 

17400 


Ccccgccaca 


t^abgcagat^a 


gagacacgag 


17460 


gccccaccca 


tagggccgca 


gacGCCCCGa 


JL732U 


ctgggggcvc 


ugcgucccac 


ccaacagacg 


^ «VCO A 
1 /3D0 




ciycy^cacau 


gagccagcca 




ywcii»y uycaa 


uay uy u c ugc 


guuuygcggu 


J. / /UU 




wcy cy y uy y a 


y c b ccay w b u 


X / /OU 




t«\#ct wyy itfwCita 


cty w vy ^ w V ww 


17B20 




Vi> y ciy ^ w>y y GLd 


n^at*aaanl~n 
y wcioidy i»y 






di*^y ^»y w (>y\^ 


y ^ bwc*^ wy w w 


1794.0 


gcugcaccaa 


cgaaccggcG 


aacgGgcggg 


1 DnnA 
XoUOU 


cgcttcctcg 


ctcactgact 


cgctgcgctc 


18060 


tcactcaaag 


gcggtaatac 


ggttatccac 


18120 


gtgagcaaaa 


ggccagcaaa 


aggccaggaa 


18180 


ccataggctc 


cgccGGGCtg 


aGgagcatGa 


18240 


aaacccgaca 


ggactataaa 


gataccaggc 


18300 
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gtttccccct ggaagctccc tcgtgcgctc tcctgttccg accctgccgc ttaccggata 18360 
cctgtccgcc tttctccctt cgggaagcgt ggcgctttct catagctcac gctgtaggta 18420 
tctcagttcg gtgtaggtcg ttcgctccaa gctgggctgt gtgcacgaac cccccgttca 18480 
gcccgaccgc tgcgccttat ccggtaacta tcgtcttgag tccaacccgg taagacacga 18540 
cttatcgcca ctggcagcag ccactggtaa caggattagc agagcgaggt atgtaggcgg 18600 
tgctacagag ttcttgaagt ggtggcctaa ctacggctac actagaagga cagtatttgg 18660 
tatctgcgct ctgctgaagc cagttacctt cggaaaaaga gttggtagct cttgatccgg 18720 
caaacaaacc accgctggta gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag 18780 
aaaaaaagga tctcaagaag atcctttgat cttttctacg gggtctgacg ctcagtggaa 18840 
cgaaaactca cgttaaggga ttttggtcat gagattatca aaaaggatct tcacctagat 18900 
ccttttaaat taaaaatgaa gttttaaatc aatctaaagt atatatgagt aaacttggtc 18960 
tgacagttac caatgcttaa tcagtgaggc acctatctca gcgatctgtc tatttcgttc 1S020 
atccatagtt gcctgactcc ccgtcgtgta gataactacg atacgggagg gcttaccatc 19080 
tggccccagt gctgcaatga taccgcgaga cccacgctca ccggctccag atttatcagc 19140 
aataaaccag ccagccggaa gggccgagcg cagaagtggt cctgcaactt tatccgcctc 19200 
catccagtct attaattgtt gccgggaagc tagagtaagt agttcgccag ttaatagttt 19260 
gcgcaacgtt gttgccattg ctacaggcat cgtggtgtca cgctcgtcgt ttggtatggc 19320 
ttcattcagc tccggttccc aacgatcaag gcgagttaca tgatccccca tgttgtgcaa 19380 
aaaagcggtt agctccttcg gtcctccgat cgttgtcaga agtaagttgg ccgcagtgtt 19440 
atcactcatg gttatggcag cactgcataa ttctcttact gtcatgccat ccgtaagatg 19500 
cttttctgtg actggtgagt actcaaccaa gtcattctga gaatagtgta tgcggcgacc 19560 
gagttgctct tgcccggcgt caatacggga taataccgcg ccacatagca gaactttaaa 19620 
agtgctcatc attggaaaac gttcttcggg gcgaaaactc tcaaggatct taccgctgtt 19680 
gagatccagt tcgatgtaac ccactcgtgc acccaactga tcttcagcat cttttacttt 19740 
caccagcgtt: tctgggtgag caaaaacagg aaggcaaaat gccgcaaaaa agggaataag 19800 
ggcgacacgg aaatgttgaa tactcatact cttccttttt caatattatt gaagcattta 19860 
tcagggttat tgtctcatga gcggatacat atttgaatgt atttagaaaa ataaacaaat 19920 
aggggttccg cgcacatttc cccgaaaagt gccacctaaa ttgtaagcgt taatattttg 19980 
ttaaaattcg cgttaaattt ttgttaaatc agctcatttt ttaaccaata ggccgaaatc 20040 
ggcaaaatcc cttataaatc aaaagaatag accgagatag ggttgagtgt tgttccagtt 20100 
tggaacaaga gtccactatt aaagaacgtg gactccaacg tcaaagggcg aaaaaccgtc 20160 
tatcagggcg atggcccact acgtgaacca tcaccctaat caagtttttt ggggtcgagg 20220 
tgccgtaaag cactaaatcg gaaccctaaa gggagccccc gatttagagc ttgacgggga 20280 
aagccggcga acgtggcgag aaaggaaggg aagaaagcga aaggagcggg cgctagggcg 20340 
ctggcaagtg tagcggtcac gctgcgcgta accaccacac ccgccgcgct taatgcgccg 20400 
ctacagggcg cgtcccattc gccattcagg ctgcgcaact gttgggaagg gcgatcggtg 20460 
cgggcctctt cgctattacg ccagctggcg aaagggggat gtgctgcaag gcgattaagt 20520 
tgggtaacgc cagggttttc ccagtcacga cgttgtaaaa cgacggccag tgagcgcgcg 20580 
taatacgact cactataggg cgaattgggt accgggcccc ccc 20623 



<210> 2 
<211> 6708 
<212> DNA 

<213> Airtiflclal Sequence 
<220> 

<223> Descrlptioa of Artificial Sequence: R15/APPA + 



8 



wo 00/64247 



PCT/CAOO/00430 



intron plasmid with pBLCAT3 vector 
<400> 2 

ggatcccctt tgctatgtag tttttaatgg aaattacaac ccatagtgtg ttgataaata 60 
gagagtcctg tttggtttaa gcaacctctg tttctcataa actccataaa aacaggaata 120 
ctctttgttt Gtagcataac caaaagattt agtgaattga aaacaatgtt cccttagagt 180 
ataggtctaa taaccccgaa aatattacca tgatactgag catttgtaag tatctcatag 240 
catgtagtat ccatagtcca tcaatgagag agacatttaa catgattttc attaatcagg 300 
tggaaaagac atgacaacat tcacaggcac tgcacagaac atagtggtcc accttgcaca 360 
tatttcacta aactaggttt atctattttg ttgctttctc taacatctct gcaatgaagc 420 
aggtcaacag tgccacatat cctttactta acctaaggaa cacaaaaaat tttctacata 480 
tatcctggtt agagagtgct taaaataagt tttccaagaa tggaaaagaa atgttctgac 540 
ttaacaatta agacagtatt tatttaaagc aagaaatatg aggcacacaa gaaaatattt 600 
tgggaagaaa ccatttggtg aacaatattt caaataaaaa tagacaaaca tagttaattg 660 
taaaacatat gtttgaccag cccttctttt caataggctt aatgtgaata aaatgttaaa 720 
gattctcttt gggtggctgc aaattgtcca cgaataagac aaaatataaa aataaggact 780 
gagtctcaca aaatgaaaag gaaatatatt cagaaagaga atcttgagag aatgtgttgt 840 
cacaaattaa agaaaacctg tggtgaatga catcctgagg cctgagctat tactgacatt 900 
taagataaag gtaactgtat acatttgtcc cattgagggg acaagaaagc tgctctcatg 960 
ttcagctcta taattcttgc cttaaacaac ttaaatagaa tgatttaaaa tatggagctg 1020 
tccatggacc tttgaaatat aaaatagtca agcaacttat caaggaatta cagattcctt 1080 
gatactaaca caggtaaatc ccacacgtgt tttgagacta catttgctgg gattttattg 1140 
atgtaatagg tcacatgttt ttcgggccaa tgttgctgtt attcggttac ttcaagagaa 1200 
tagtggcaac tgatgctatg tattctaggg gtttgaagtg atgtttcatg attgaaattt 1260 
gtaaaagaat aacatcatca ttcttaacaa tagaacatat aaagtcacac agaagtgaca 1320 
gtgtttaagc tgtactattg atcaaagaaa tttattacct tcagtttcaa tggaaataat 1380 
tactgataat acaaacatgt gtgaacacac actaatccta tccaaatgca cagtgataca 1440 
cagaaaatat tagcaagtag aatgcaatat ttatataacg attgtattta tcaatcaatt 1500 
gtatgtatca atatatgggc tattttctta cacatgattt tattcaaatt tactctaatc 1560 
attgttgaac catttagaaa aggcatactg gcaacttttc cttacctcat ccagctgggc 1620 
aaaagtccca gtgtggagta aaggatgcaa gatttcctgc tctgttaagt ataaaataat 1680 
agtatgaatt caaaggtgcc attcttctgc ttctagttat aaaggcagtg cttgcttctt 1740 
ccagcacaga tctggatctc gaggagcttg gcgagatttt caggagctaa ggaagctaaa 1800 
agccgccacc atgaaagcca tcttaatccc atttttatct cttctgattc cgttaacccc 1860 
gcaatctgca ttcgctcaga gtgagccgga gctgaagctg gaaagtgtgg tgattgtcag 1920 
tcgtcatggt gtgcgtgctc caaccaaggc cacgcaactg atgcaggatg tcaccccaga 1980 
cgcatggcca acctggccgg taaaactggg ttggctgaca ccgcgcggtg gtgagctaat 2040 
cgcctatctc ggacattacc aacgccagcg tctggtagcc gacggattgc tggcgaaaaa 2100 
gggctgcccg cagtctggtc aggtcgcgat tattgctgat gtcgacgagc gtacccgtaa 2160 
aacaggcgaa gccttcgccg ccgggctggc acctgactgt gcaataaccg tacataccca 2220 
ggcagatacg tccagtcccg atccgttatt taatcctcta aaaactggcg tttgccaact 2280 
ggataacgcg aacgtgactg acgcgatcct cagcagggca ggagggtcaa ttgctgactt 2340 
taccgggcat cggcaaacgg cgtttcgcga actggaacgg gtgcttaatt ttccgcaatc 2400 
aaacttgtgc cttaaacgtg agaaacagga cgaaagctgt tcattaacgc aggcattacc 2460 
atcggaactc aaggtgagcg ccgacaatgt ctcattaacc ggtgcggtaa gcctcgcatc 2520 
aatgctgacg gagatatttc tcctgcaaca agcacaggga atgccggagc cggggtgggg 2580 
aaggatcacc gattcacacc agtggaacac cttgctaagt ttgcataacg cgcaatttta 2640 
tttgctacaa cgcacgccag aggttgcccg cagccgcgcc accccg|ctat tagatttgat 2700 
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caagacagcg ttgacgcccc atccaccgca 
ttcagtgctg tttatcgccg gacacgatac 
gctcaactgg acgcttcccg gtcagccgga 
tgaacgctgg cgtcggctaa gcgataacag 
gactttacag cagatgcgtg ataaaacgcc 
gaaactgacc ctggcaggat gtgaagagcg 
ttttacgcaa atcgtgaatg aagcacgcat 
gttattggtg cccttaaacg cctggtgcta 
tggcagaaat tcgccggatc tttgtgaagg 
acaaactacc tacagagatt taaagctcta 
gtgttaaact actgattcta attgtttgtg 
aatgggagca gtggtggaat gcctttaatg 
catctagtga tgatgaggct actgctgact 
gaaaggtaga agaccccaag gactttcctt 
tgtttagtaa tagaactctt gcttgctttg 
tgctatacaa gaaaattatg gaaaaatatt 
ataatcataa catactgttt tttcttactc 
actatgctca aaaattgtgt acctttagct 
atttgatgta tagtgccttg actagagatc 
ttacttgctt taaaaaacct cccacacctc 
attgtitgttg ttaacttgtt tattgcagct 
acaaatttca caaataaagc atttttttca 
atcaatgtat cttatcatgt ctggatcgat 
tggtcatagc tgtttcctgt gtgaaattgt 
gccggaagca taaagtgtaa agcctggggt 
gcgttgcgct cactgcccgc tttccagtcg 
atcggccaac gcgcggggag aggcggtttg 
actgactcgc tgcgctcggt cgttcggctg 
gtaat:acggt tatccacaga atcaggggat 
cagcaaaagg ccaggaaccg taaaaaggcc 
ccccctgacg agcatcacaa aaatcgacgc 
ctataaagat: accaggcgtt tccccctgga 
ctgccgctta ccggatacct gtccgccttt 
tgctcacgct gtaggtatct cagttcggtg 
cacgaacccc ccgttcagcc cgaccgctgc 
aaccoggtaa gacacgactt atcgccactg 
gcgaggtatg taggcggtgc tacagagttc 
agaaggacag tatttggtat ctgcgctctg 
ggtagctctt gatccggcaa acaaaccacc 
cagcagatta cgcgcagaaa aaaaggatct 
tctgacgctc agtggaacga aaactcacgt 
aggatcttca cctagatcct tttaaattaa 
tatgagtaaa cttggtctga cagttaccaa 
atctgtctat ttcgttcatc catagttgcc 
cgggagggct taccatctgg ccccagtgct 
gctccagatt tatcagcaat aaaccagcca 
gcaactttat ccgcctccat ccagtctatt 
tcgccagtta atagtttgcg caacgttgtt 



aaaacaggcg 


tatggtgtqa 


cattacccac 


2760 


taatctggca 


aatctcggcg 


gcgcactgga 


2820 


taacacgccg 


ccaggtggtg 


aactggtgtt 


2660 


ccagtggatt 


caggtttcgc 


tggtcttcca 


2940 


gctgtcatta 


aatacgccgc 


ccggagaggt 


3000 


aaatgcgcag 


ggcatgtgtt 


cgttggcagg 


3060 


acccgcttgc 


agtttgtaag 


gtataaggca 


3120 


cgcctgaata 


«L9tgataata 


dgcggatgaa 


3180 


aaccttactt 


ctgtggtgtg 


acataattgg 


3240 


aggtaaatat 


aaaattttta 


agtgtataat 


3300 


tattttagat 


tccaacctat 


ggaactgatg 


3360 


aggaaaacct 


gttttgctca 


gaagaaatgc 


3420 


ctcaacattc 


tactcctcca 


aaaaagaaga 


3480 


cagaattgct 


aagttttttg 


agtcatgctg 


3540 


ctatttacac 


cacaaaggaa 


aaagctgcac 


3600 


ctgtaacctt 


tataagtagg 


cataacagtt 


3660 


cacacaggca 


tagagtgtct 


gctattaata 


3720 


ttttaatttg 


taaaggggtt 


aataaggaat 


3780 


ataatcagcc 


ataccacatt 


tgtagaggtt 


3840 


cccctgaacc 


tgaaacataa 


aatgaatgca 


3900 


tataatggtt 


acaaataaag 


caatagcatc 


3960 


ctgcattcta 


gttgtggttt 


gtccaaactc 


4020 


ccccgggtac 


cgagctcgaa 


ttcgtaatca 


4080 


tatccgctca 


caattccaca 


caacatacga 


4140 


gcctaatgag 


tgagctaact 


cacattaatt 


4200 


ggaaacctgt 


cgtgccagct 


gcattaatga 


4260 


cgtattgggc 


gctcttccgc 


ttcctcgctc 


4320 


cggcgagcgg 


tatcagctca 


ctcaaaggcg 


4360 


aacgcaggaa 


agaacatgtg 


agcaaaaggc 


4440 


gcgttgctgg 


cgtttttcca 


taggctccgc 


4500 


tcaagtcaga 


ggtggcgaaa 


cccgacagga 


4560 


agctccctcg 


tgcgctctcc 


tgttccgacc 


4620 


ctcccttcgg 


gaagcgtggc 


gctttctcaa 


4680 


taggtcgttc 


gctccaagct 


gggctgtgtg 


4740 


gccttatccg 


gtaactatcg 


tcttgagtcc 


4600 


gcagcagcca 


ctggtaacag 


gattagcaga 


4860 


ttgaagtggt 


ggcctaacta 


cggctacact 


4920 


ctgaagccag 


ttaccttcgg 


aaaaagagtt 


4980 


gctggtagcg 


gtggtttttt 


tgtttgcaag 


5040 


caagaagatc 


ctttgatctt 


ttctacgggg 


5100 


taagggattt 


tggtcatgag 


attatcaaaa 


5160 


aaatgaagtt 


ttaaatcaat 


ctaaagtata 


5220 


tgcttaatca 


gtgaggcacc 


tatctcagcg 


5280 


tgactccccg 


tcgtgtagat 


aactacgata 


5340 


gcaatgatac 


cgcgagaccc 


acgctcaccg 


5400 


gccggaaggg 


ccgagcgcag 


aagtggtcct 


5460 


aattgttgcc 


gggaagctag 


agtaagtagt 


5520 


gccattgcta 


caggcatcgt 


ggtgtcacgc 


5580 
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tcgtcgtttg gtatggcttc attcagctcc ggttcccaac gatcaaggcg agttacatga 5640 

tcccccatgt tgtgcaaaaa agcggttagc tccttcggtc ctccgatcgt tgtcagaagt 5700 

aagttggccg cagtgttatc actcatggtt atggcagcac tgcataattc tcttactgtc 5760 

atgccatccg taagatgctt ttctgtgact ggtgagtact caaccaagtc attctgagaa 5820 

tagtgtatgc ggcgaccgag ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca 5880 

catagcagaa ctttaaaagt gctcatcatt ggaaaacgtt cttcggggcg aaaactctca 5940 

aggatcttac cgctgttgag atccagttcg atgtaaccca ctcgtgcacc caactgatct 6000 

tcagcatctt ttactttcac cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc 6060 

gcaaaaaagg gaataagggc gacacggaaa tgttgaatac tcatactctt cctttttcaa 6120 

tattattgaa gcatttatca gggttattgt ctcatgagcg gatacatatt tgaatgtatt 6180 

tagaaaaata aacaaatagg ggttccgcgc acatttcccc gaaaagtgcc acctgacgtc 6240 

taagaaacca ttattatcat gacattaacc tataaaaata ggcgtatcac gaggcccttt 6300 

cgtctcgcgc gtttcggtga tgacggtgaa aacctctgac acatgcagct cccggagacg 6360 

gtcacagctt gtctgtaagc ggatgccggg agcagacaag cccgtcaggg cgcgtcagcg 6420 

ggtgttggcg ggtgtcgggg ctggcttaac tatgcggcat cagagcagat tgtactgaga 6480 

gtgcaccata tgcggtgtga aataccgcac agatgcgtaa ggagaaaata ccgcatcagg 6540 

cgccattcgc cattcaggct gcgcaactgt tgggaagggc gatcggtgcg ggcctcttcg 6600 

ctattacgcc agctggcgaa agggggatgt gctgcaaggc gattaagttg ggtaacgcca 6660 

gggttttccc agtcacgacg ttgtaaaacg acggccagtg ccaagctt 6708 

<210>.3 
<211> 4060 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description o£ Artificial Sequence: R15/APPA + 
Intron transgene 



<400> 3 



ggatcccctt 


tgctatgtag 


tttttaatgg 


aaattacaac 


ccatagtgtg 


ttgataaata 


60 


gagagtcctg 


tttggtttaa 


gcaacctctg 


tttctcataa 


actccataaa 


aacaggaata 


120 


ctctttgttt 


ctagcataac 


caaaagattt 


agtgaattga 


aaacaatgtt 


cccttagagt 


180 


ataggtctaa 


taaccccgaa 


aatattacca 


tgatactgag 


catttgtaag 


tatctcatag 


240 


catgtagtat 


ccatagtcca 


tcaatgagag 


agacatttaa 


catgattttc 


attaatcagg 


300 


tggaaaagac 


atgacaacat 


tcacaggcac 


tgcacagaac 


atagtggtcc 


accttgcaca 


360 


tatttcacta 


aactaggttt 


atctattttg 


ttgctttctc 


taacatctct 


gcaatgaagc 


420 


aggtcaacag 


tgccacatat 


cctttactta 


acctaaggaa 


cacaaaaaat 


tttctacata 


480 


tatcctggtt 


agagagtgct 


taaaataagt 


tttccaagaa 


tggaaaagaa 


atgttctgac 


540 


ttaacaatta 


agacagtatt 


tatttaaagc 


aagaaatatg 


aggcacacaa 


gaaaatattt 


600 


tgggaagaaa 


ccatttggtg 


aacaatattt 


caaataaaaa 


tagacaaaca 


tagttaattg 


660 


taaaacatat 


gtttgaccag 


cccttctttt 


caataggctt 


aatgtgaata 


aaatgttaaa 


720 


gattctcttt 


gggtggctgc 


aaattgtcca 


cgaataagac 


aaaatataaa 


aataaggact 


780 


gagtctcaca 


aaatgaaaag 


gaaatatatt 


cagaaagaga 


atcttgagag 


aatgtgttgt 


840 


cacaaattaa 


agaaaacctg 


tggtgaatga 


catcctgagg 


cctgagctat 


tactgacatt 


soo 


taagataiaag 


gtaactgtat 


acatttgtcc 


cattgagggg 


acaagaaagc 


tgctctcatg 


960 


ttcagctcta 


taattcttgc 


cttaaacaac 


ttaaatagaa 


tgatttaaaa 


tatggagctg 


1020 
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tccatggacc tttgaaatat aaaatagtca agcaacttat caaggaatta cagattcctt 1080 
gatactaaca caggtaaatc ccacacgtgt tttgagacta catttgctgg gattttattg 1140 
atgtaatagg tcacatgttt ttcgggccaa tgttgctgtt attcggttac ttcaagagaa 1200 
tagtggcaac tgatgctatg tattctaggg gtttgaagtg atgtttcatg attgaaattt 1260 
gtaaaagaat aacatcatca ttcttaacaa tagaacatat aaagtcacac agaagtgaca 1320 
gtgtttaagc tgtactattg atcaaagaaa tttattacct tcagtttcaa tggaaataat 1380 
tactgataat acaaacatgt gtgaacacac actaatccta tccaaatgca cagtgataca 1440 
cagaaaatat tagcaagtag aatgcaatat ttatataacg attgtattta tcaatcaatt 1500 
gtatgtatca atatatgggc tattttctta cacatgattt tattcaaatt tactctaatc 1560 
attgttgaac catttagaaa aggcatactg gcaacttttc cttacctcat ccagctgggc 1620 
aaaagtccca gtgtggagta aaggatgcaa gatttcctgc tctgttaagt ataaaataat 1680 
agtatgaatt caaaggtgcc attcttctgc ttctagttat aaaggcagtg cttgcttctt 1740 
ccagcacaga tctggatctc gaggagcttg gcgagatttt caggagctaa ggaagctaaa 1800 
agccgccacc atgaaagcca tcttaatccc atttttatct cttctgattc cgttaacccc 1860 
gcaatctgca ttcgctcaga gtgagccgga gctgaagctg gaaagtgtgg tgattgtcag 1920 
tcgtcatggt gtgcgtgctc caaccaaggc cacgcaactg atgcaggatg tcaccccaga 1980 
cgcatggcca acctggccgg taaaactggg ttggctgaca ccgcgcggtg gtgagctaat 2040 
cgcctatctc ggacattacc aacgccagcg tctggtagcc gacggattgc tggcgaaaaa 2100 
gggctgcccg cagtctggtc aggtcgcgat tattgctgat gtcgacgagc gtacccgtaa 2160 
aacaggcgaa gccttcgccg ccgggctggc acctgactgt gcaataaccg tacataccca 2220 
ggcagat:acg tccagtcccg atccgttatt taatcctcta aaaactggcg tttgccaact 2280 
ggataacgcg aacgtgactg acgcgatcct cagcagggca ggagggtcaa ttgctgactt 2340 
taccgggcat cggcaaacgg cgtttcgcga actggaacgg gtgcttaatt ttccgcaatc 2400 
aaacttgtgc cttaaacgtg agaaacagga cgaaagctgt tcattaacgc aggcattacc 2460 
atcggaactc aaggtgagcg ccgacaatgt ctcattaacc ggtgcggtaa gcctcgcatc 2520 
aatgctgacg gagatatttc tcctgcaaca agcacaggga atgccggagc cggggtgggg 2580 
aaggatcacc gattcacacc agtggaacac cttgctaagt ttgcataacg cgcaatttta 2640 
tttgctacaa cgcacgccag aggttgcccg cagccgcgcc accccgttat tagatttgat 2700 
caagacagcg ttgacgcccc atccaccgca aaaacaggcg tatggtgtga cattacccac 2760 
ttcagtgctg tttatcgccg gacacgatac taatctggca aatctcggcg gcgcactgga 2820 
gctcaactgg acgcttcccg gtcagccgga taacacgccg ccaggtggtg aactggtgtt 2880 
tgaacgctgg cgtcggctaa gcgataacag ccagtggatt caggtttcgc tggtcttcca 2940 
gactttacag cagatgcgtg ataaaacgcc gctgtcatta aatacgccgc ccggagaggt 3000 
gaaactgacc ctggcaggat gtgaagagcg aaatgcgcag ggcatgtgtt cgttggcagg 3060 
ttttacgcaa atcgtgaatg aagcacgcat acccgcttgc agtttgtaag gtataaggca 3120 
gttattggtg cccttaaacg cctggtgcta cgcctgaata agtgataata agcggatgaa 3180 
tggcagaaat tcgccggatc tttgtgaagg aaccttactt ctgtggtgtg acataattgg 3240 
acaaactacc tacagagatt taaagctcta aggtaaatat aaaattttta agtgtataat 3300 
gtgttaaact actgattcta attgtttgtg tattttagat tccaacctat ggaactgatg 3360 
aatgggagca gtggtggaat gcctttaatg aggaaaacct gttttgctca gaagaaatgc 3420 
catctagtga tgatgaggct actgctgact ctcaacattc tactcctcca aaaaagaaga 3480 
gaaaggtaga agaccccaag gactttcctt cagaattgct aagttttttg agtcatgctg 3540 
tgtttagtaa tagaactctt gcttgctttg ctatttacac cacaaaggaa aaagctgcac 3600 
tgctatacaa gaaaattatg gaaaaatatt ctgtaacctt tataagtagg cataacagtt 3660 
ataatcataa catactgttt tttcttactc cacacaggca tagagtgtct gctattaata 3720 
actatgctca aaaattgtgt acctttagct ttttaatttg taaaggggtt aataaggaat 3780 
atttgatgta tagtgccttg actagagatc ataatcagcc ataccacatt tgtagaggtt 3840 
ttacttgctt taaaaaacct cccacacctc cccctgaacc tgaaacataa aatgaatgca 3900 
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attgttcfttg ttaacttgtt tattgcagct tataatggtt acaaataaag caatagcatc 
acaaatttca caaataaagc atttttttca ctgcattcta gttgtggttt gtccaaactc 
atcaatgtat cttatcatgt ctggatcgat ccccgggtac 

<210> 4 
<211> 6116 
<212> DKA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: R15/APPA 
plasmid with pBLCAT3 vector 

<400> 4 

ggatcccctt tgctatgtag tttttaatgg aaattacaac ccatagtgtg ttgataaata 60 
gagagtcctg tttggtttaa gcaacctctg tttctcataa actccataaa aacaggaata 120 
ctctttgttt ctagcataac caaaagattt agtgaattga aaacaatgtt cccttagagt 180 
ataggtctaa taaccccgaa aatattacca tgatactgag catttgtaag tatctcatag 240 
catgtagtat ccatagtcca tcaatgagag agacatttaa catgattttc attaatcagg 300 
tggaaaagac atgacaacat tcacaggcac tgcacagaac atagtggtcc accttgcaca 360 
tatttcacta aactaggttt atctattttg ttgctttctc taacatctct gcaatgaagc 420 
aggtcaacag tgccacatat cctttactta acctaaggaa cacaaaaaat tttctacata 480 
tatcctggtt agagagtgct taaaataagt tttccaagaa tggaaaagaa atgttctgac 540 
ttaacaatta agacagtatt tatttaaagc aagaaatatg aggcacacaa gaaaatattt 600 
tgggaagaaa ccatttggtg aacaatattt caaataaaaa tagacaaaca tagttaattg 660 
taaaacatat gtttgaccag cccttctttt caataggctt aatgtgaata aaatgttaaa 720 
gattctcttt gggtggctgc aaattgtcca cgaataagac aaaatataaa aataaggact 780 
gagtctcaca aaatgaaaag gaaatatatt cagaaagaga atcttgagag aatgtgttgt 840 
cacaaattaa agaaaacctg tggtgaatga catcctgagg cctgagctat tactgacatt 900 
taagataaag gtaactgtat acatttgtcc cattgagggg acaagaaagc tgctctcatg 960 
ttcagctcta taattcttgc cttaaacaac ttaaatagaa tgatttaaaa tatggagctg 1020 
tccatggacc tttgaaatat aaaatagtca agcaacttat caaggaatta cagattcctt 1080 
gatactaaca caggtaaatc ccacacgtgt tttgagacta catttgctgg gattttattg 1140 
atgtaatagg tcacatgttt ttcgggccaa tgttgctgtt attcggttac ttcaagagaa 1200 
tagtggcaac ^gatgctatg tattctaggg gtttgaagtg atgtttcatg attgaaattt 1260 
gtaaaagaat aacatcatca ttcttaacaa tagaacatat aaagtcacac agaagtgaca 1320 
gtgtttaagc tgtactattg atcaaagaaa tttattacct tcagtttcaa tggaaataat 1380 
tactgataat acaaacatgt gtgaacacac actaatccta tccaaatgca cagtgataca 1440 
cagaaaatat tagcaagtag aatgcaatat ttatataacg attgtattta tcaatcaatt 1500 
gtatgtatca atatatgggc tattttctta cacatgattt tattcaaatt tactctaatc 1560 
attgttgaac catttagaaa aggcatactg gcaacttttc cttacctcat ccagctgggc 1620 
aaaagtccca gtgtggagta aaggatgcaa gatttcctgc tctgttaagt ataaaataat 1680 
agtatgaatt caaaggtgcc attcttctgc ttctagttat aaaggcagtg cttgcttctt 1740 
ccagcacaga tctggatctc gaggagcttg gcgagatttt caggagctaa ggaagctaaa 1800 
agccgccacc atgaaagcca tcttaatccc atttttatct cttctgattc cgttaacccc 1860 
gcaatctgca ttcgctcaga gtgagccgga gctgaagctg gaaagtgtgg tgattgtcag 1920 
tcgtcatggt gtgcgtgctc caaccaaggc cacgcaactg atgcaggatg tcaccccaga 1980 



3960 
4020 
4060 
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cgcatggcca acctggccgg taaaactggg 
cgcctatctc ggacattacc aacgccagcg 
gggctgcccg cagtctggtc aggtcgcgat 
aacaggcgaa gccttcgccg ccgggctggc 
ggcagatacg tccagtcccg atccgttatt 
ggataacgcg aacgtgactg acgcgatcct 
taccgggcat cggcaaacgg cgtttcgcga 
aaacttgtgc cttaaacgtg agaaacagga 
atcggaactc aaggtgagcg ccgacaatgt 
aatgctgacg gagatatttc tcctgcaaca 
aaggatcacc gattcacacc agtggaacac 
tttgctacaa cgcacgccag aggttgcccg 
caagacagcg ttgacgcccc atccaccgca 
ttcagtgctg tttatcgccg gacacgatac 
gctcaactgg acgcttcccg gtcagccgga 
tgaacgctgg cgtcggctaa gcgataacag 
gactttacag cagatgcgtg ataaaacgcc 
gaaactgacc ctggcaggat gtgaagagcg 
ttttacgcaa atcgtgaatg aagcacgcat 
gttattggtg cccttaaacg cctggtgcta 
tggcagaaat tcgccggatc tttgtgaagg 
acaaactacc tacagagatt taaaaaacct 
aatgaatgca attgttgttg ttaacttgtt 
caatagcatc acaaatttca caaataaagc 
gtccaaactc atcaatgtat cttatcatgt 
ttcgtaatca tggtcatagc tgtttcctgt 
caacatacga gccggaagca taaagtgtaa 
cacattaatt gcgttgcgct cactgcccgc 
gcattaatga atcggccaac gcgcggggag 
ttcctcgctc actgactcgc tgcgctcggt 
ctcaaaggcg gtaatacggt tatccacaga 
agcaaaaggc cagcaaaagg ccaggaaccg 
taggctccgc ccccctgacg agcatcacaa 
cccgacagga ctataaagat accaggcgtt 
tgttccgacc ctgccgctta ccggatacct 
gctttctcaa tgctcacgct gtaggtatct 
gggctgtgtg cacgaacccc ccgttcagcc 
tcttgagtcc aacccggtaa gacacgactt 
gattagcaga gcgaggtatg taggcggtgc 
cggctacact agaaggacag tatttggtat 
aaaaagagtt ggtagctctt gatccggcaa 
tgtttgcaag cagcagatta cgcgcagaaa 
ttctacgggg tctgacgctc agtggaacga 
attatcaaaa aggatcttca cctagatcct 
ctaaagtata tatgagtaaa cttggtctga 
tatctcagcg atctgtctat ttcgttcatc 
aactacgata cgggagggct taccatctgg 
acgctcaccg gctccagatt tatcagcaat 



ttggctgaca ccgcgcggtg gtgagctaat 2040 
tctggtagcc gacggattgc tggcgaaaaa 2100 
tattgctgat gtcgacgagc gtacccgtaa 2160 
acctgactgt gcaataaccg tacataccca 2220 
taatcctcta aaaactggcg tttgccaact 2280 
cagcagggca ggagggtcaa ttgctgactt 2340 
actggaacgg gtgcttaatt ttccgcaatc 2400 
cgaaagctgt tcattaacgc aggcattacc 2460 
ctcattaacc ggtgcggtaa gcctcgcatc 2520 
agcacaggga atgccggagc cggggtgggg 2580 
cttgctaagt ttgcataacg cgcaatttta 2640 
cagccgcgcc accccgttat tagatttgat 2700 
aaaacaggcg tatggtgtga cattacccac 2760 
taatctggca aatctcggcg gcgcactgga 2820 
taacacgccg ccaggtggtg aactggtgtt 2880 
ccagtggatt caggtttcgc tggtcttcca 2940 
gctgtcatta aatacgccgc ccggagaggt 3000 
aaatgcgcag ggcatgtgtt cgttggcagg 3060 
acccgcttgc agtttgtaag gtataaggca 3120 
cgcctgaata agtgataata agcggatgaa 3180 
aaccttactt ctgtggtgtg acataattgg 3240 
cccacacctc cccctgaacc tgaaacataa 3300 
tattgcagct tataatggtt acaaataaag 3360 
atttttttca ctgcattcta gttgtggttt 3420 
ctggatcgat ccccgggtac cgagctcgaa 3480 
gtgaaattgt tatccgctca caattccaca 3540 
agcctggggt gcctaatgag tgagctaact 3600 
tttccagtcg ggaaacctgt cgtgccagct 3660 
aggcggtttg cgtattgggc gctcttccgc 3720 
cgttcggctg cggcgagcgg tatcagctca 3780 
atcaggggat aacgcaggaa agaacatgtg 3640 
taaaaaggcc gcgttgctgg cgtttttcca 3900 
aaatcgacgc tcaagtcaga ggtggcgaaa 3960 
tccccctgga agctccctcg tgcgctctcc 4020 
gtccgccttt ctcccttcgg gaagcgtggc 4080 
cagttcggtg taggtcgttc gctccaagct 4140 
cgaccgctgc gccttatccg gtaactatcg 4200 
atcgccactg gcagcagcca ctggtaacag 4260 
tacagagttc ttgaagtggt ggcctaacta 4320 
ctgcgctctg ctgaagccag ttaccttcgg 4380 
acaaaccacc gctggtagcg gtggtttttt 4440 
aaaaggatct caagaagatc ctttgatctt 4500 
aaactcacgt taagggattt tggtcatgag 4560 
tttaaattaa aaatgaagtt ttaaatcaat 4620 
cagttaccaa tgcttaatca gtgaggcacc 4680 
catagttgcc tgactccccg tcgtgtagat 4740 
ccccagtgct gcaatgatac cgcgagaccc 4800 
aaaccagcca gccggaaggg ccgagcgcag 4860 
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aagtggtcct gcaactttat ccgcctccat 
agtaagtagt tcgccagtta atagtttgcg 
ggtgtcacgc tcgtcgtttg gtatggcttc 
agttacatga tcccccatgt tgtgcaaaaa 
tgtcagaagt aagttggccg cagtgttatc 
tcttactgtc atgccatccg taagatgctt 
attctgagaa tagtgtatgc ggcgaccgag 
taccgcgcca catagcagaa ctttaaaagt 
aaaactctca aggatcttac cgctgttgag 
caactgatct tcagcatctt ttactttcac 
gcaaaatgcc gcaaaaaagg gaataagggc 
cctttttcaa tattattgaa gcatttatca 
tgaatgtatt tagaaaaata aacaaatagg 
acctgacgtc taagaaacca ttattatcat 
gaggcccttt cgtctcgcgc gtttcggtga 
cccggagacg gtcacagctt gtctgtaagc 
cgcgtcagcg ggtgttggcg ggtgtcgggg 
tgtactgaga gtgcaccata tgcggtgtga 
ccgcatcagg cgccattcgc cattcaggct 
ggcctcttcg ctattacgcc agctggcgaa 
ggtaacgcca gggttttccc agtcacgacg 



ccagtctatt aattgttgcc gggaagctag 4920 
caacgttgtt gccattgcta caggcatcgt 4980 
attcagctcc ggttcccaac gatcaaggcg 5040 
agcggttagc tccttcggtc ctccgatcgt 5100 
actcatggtt atggcagcac tgcataattc 5160 
ttctgtgact ggtgagtact caaccaagtc 5220 
ttgctcttgc ccggcgtcaa tacgggataa 5280 
gctcatcatt ggaaaacgtt cttcggggcg 5340 
atccagttcg atgtaaccca ctcgtgcacc 5400 
cagcgtttct gggtgagcaa aaacaggaag 5460 
gacacggaaa tgttgaatac tcatactctt 5520 
gggttattgt ctcatgagcg gatacatatt 5580 
ggttccgcgc acatttcccc gaaaagtgcc 5640 
gacattaacc tataaaaata ggcgtatcac 5700 
tgacggtgaa aacctctgac acatgcagct 5760 
ggatgccggg agcagacaag cccgtcaggg 5820 
ctggcttaac tatgcggcat: cagagcagat 5880 
aataccgcac agatgcgtaa ggagaaaata 5940 
gcgcaactgt tgggaagggc gatcggtgcg 6000 
agggggatgt gctgcaaggc gattaagttg 6060 
ttgtaaaacg acggccagtg ccaagc 6116 



<210> 5 
<211> 3.470 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: R15/APPA 
transgene 

<400> 5 

ggatcccctt tgctatgtag tttttaatgg aaattacaac ccatagtgtg ttgataaata 60 
gagagtcctg tttggtttaa gcaacctctg tttctcataa actccataaa aacaggaata 120 
ctctttgttt ctagcataac caaaagattt agtgaattga aaacaatgtt cccttagagt 160 
ataggtctaa taaccccgaa aatattacca tgatactgag catttgtaag tatctcatag 240 
catgtagtat ccatagtcca tcaatgagag agacatttaa catgattttc attaatcagg 300 
tggaaaagac atgacaacat tcacaggcac tgcacagaac atagtggtcc accttgcaca 360 
tatttcacta aactaggttt atctattttg ttgctttctc taacatctct gcaatgaagc 420 
aggtcaacag tgccacatat cctttactta acctaaggaa cacaaaaaat tttctacata 480 
tatcctggtt agagagtgct taaaataagt tttccaagaa tggaaaagaa atgttctgac 540 
ttaacaatta agacagtatt tatttaaagc aagaaatatg aggcacacaa gaaaatattt 600 
tgggaagaaa ccatttggtg aacaatattt caaataaaaa tagacaaaca tagttaattg 660 
taaaacatat gtttgaccag cccttctttt caataggctt aatgtgaata aaatgttaaa 720 
gattctcttt gggtggctgc aaattgtcca cgaataagac aaaatataaa aataaggact 780 
gagtctcaca aaatgaaaag gaaatatatt cagaaagaga atcttgagag aatgtgttgt 840 
cacaaattaa agaaaacctg tggtgaatga catcctgagg cctgagctat tactgacatt 900 
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taagataaag gtaactgtat acatttgtcc cattgagggg acaagaaagc tgctctcatg 960 
ttcagctcta taattcttgc cttaaacaac ttaaatagaa tgatttaaaa tatggagctg 1020 
tccatggacc tttgaaatat aaaatagtca agcaacttat caaggaatta cagattcctt 1080 
gatactaaca caggtaaatc ccacacgtgt tttgagacta catttgctgg gattttattg 1140 
atgtaatagg tcacatgttt ttcgggccaa tgttgctgtt attcggttac ttcaagagaa 1200 
tagtggcaac tgatgctatg tattctaggg gtttgaagtg atgtttcatg attgaaattt 1260 
gtaaaagaat aacatcatca ttcttaacaa tagaacatat aaagtcacac agaagtgaca 1320 
gtgtttaagc tgtactattg atcaaagaaa tttattacct tcagtttcaa tggaaataat 1380 
tactgataat acaaacatgt gtgaacacac actaatccta tccaaatgca cagtgataca 1440 
cagaaaatat tagcaagtag aatgcaatat ttatataacg attgtattta tcaatcaatt 1500 
gtatgtatca atatatgggc tattttctta cacatgattt tattcaaatt tactctaatc 1560 
attgttgaac catttagaaa aggcatactg gcaacttttc cttacctcat ccagctgggc 1620 
aaaagtccca gtgtggagta aaggatgcaa gatttcctgc tctgttaagt ataaaataat 1680 
agtatgaatt caaaggtgcc attcttctgc ttctagttat aaaggcagtg cttgcttctt 1740 
ccagcacaga tctggatctc gaggagcttg gcgagatttt caggagctaa ggaagctaaa 1800 
agccgccacc atgaaagcca tcttaatccc atttttatct cttctgattc cgttaacccc 1860 
gcaatctgca ttcgctcaga gtgagccgga gctgaagctg gaaagtgtgg tgattgtcag 1920 
tcgtcatggt gtgcgtgctc caaccaaggc cacgcaactg atgcaggatg tcaccccaga 1980 
cgcatggcca acctggccgg taaaactggg ttggctgaca ccgcgcggtg gtgagctaat 2040 
cgcctatctc ggacattacc aacgccagcg tctggtagcc gacggattgc tggcgaaaaa 2100 
gggctgcccg cagtctggtc aggtcgcgat tattgctgat gtcgacgagc gtacccgtaa 2160 
aacaggcgaa gccttcgccg ccgggctggc acctgactgt gcaataaccg tacataccca 2220 
ggcagatacg tccagtcccg atccgttatt taatcctcta aaaactggcg tttgccaact 2280 
ggataacgcg aacgtgactg acgcgatcct cagcagggca ggagggtcaa ttgctgactt 2340 
taccgggcat cggcaaacgg cgtttcgcga actggaacgg gtgcttaatt ttccgcaatc 2400 
aaacttgtgc cttaaacgtg agaaacagga cgaaagctgt tcattaacgc aggcattacc 2460 
atcggaactc aaggtgagcg ccgacaatgt ctcattaacc ggtgcggtaa gcctcgcatc 2520 
aatgctgacg gagatatttc tcctgcaaca agcacaggga atgccggagc cggggtgggg 2580 
aaggatcacc gattcacacc agtggaacac cttgctaagt ttgcataacg cgcaatttta 2640 
tttgctacaa cgcacgccag aggttgcccg cagccgcgcc accccgttat tagatttgat 2700 
caagacagcg ttgacgcccc atccaccgca aaaacaggcg tatggtgtga cattacccac 2760 
ttcagtgctg tttatcgccg gacacgatac taatctggca aatctcggcg gcgcactgga 2820 
gctcaactgg acgcttcccg gtcagccgga taacacgccg ccaggtggtg aactggtgtt 2880 
tgaacgctgg cgtcggctaa gcgataacag ccagtggatt caggtttcgc tggtcttcca 2940 
gactttacag cagatgcgtg ataaaacgcc gctgtcatta aatacgccgc ccggagaggt 3000 
gaaactgacc ctggcaggat gtgaagagcg aaatgcgcag ggcatgtgtt cgttggcagg 3060 
ttttacgcaa atcgtgaatg aagcacgcat acccgcttgc agtttgtaag gtataaggca 3120 
gttattggtg cccttaaacg cctggtgcta cgcctgaata agtgataata agcggatgaa 3180 
tggcagaaat tcgccggatc tttgtgaagg aacct-tactt ctgtggtgtg acataattgg 3240 
acaaactacc tacagagatt taaaaaacct cccacacctc cccctgaacc tgaaacataa 3300 
aatgaatgca attgttgttg ttaacttgtt tattgcagct tataatggtt acaaataaag 3360 
caatagcatc acaaatttca caaataaagc atttttttca ctgcattcta gttgtggttt 3420 
gtccaaactc atcaatgtat cttatcatgt ctggatcgat ccccgggtac 3470 



<210> 6 
<211> 5421 
<2I2> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: SV40/APPA + 
intron plasmid 

<400> 6 

cgagattttc aggagctaag gaagctaaaa gccgccacca tgaaagccat cttaatccca 60 
tttttatctc ttctgattcc gttaaccccg caatctgcat tcgctcagag tgagccggag 120 
ctgaagctgg aaagtgtggt gattgtcagt cgtcatggtg tgcgtgctcc aaccaaggcc 180 
acgcaactga tgcaggatgt caccccagac gcatggccaa cctggccggt aaaactgggt 240 
tggctgacac cgcgnggtgg tgagctaatc gcctafcctcg gacattacca acgccagcgt 300 
ctggtagccg acggattgct ggcgaaaaag ggctgcccgc agtctggtca ggtcgcgatt 360 
attgctgatg tcgacgagcg tacccgtaaa acaggcgaag ccttcgccgc cgggctggca 420 
cctgactgtg caataaccgt acatacccag gcagatacgt ccagtcccga tccgttattt 480 
aatcctctaa aaactggcgt ttgccaactg gataacgcga acgtgactga cgcgatcctc 540 
agcagggcag gagggtcaat tgctgacttt accgggcatc ggcaaacggc gtttcgcgaa 600 
ctggaacggg tgcttaattt tccgcaatca aacttgtgcc ttaaacgtga gaaacaggac 660 
gaaagctgtt cattaacgca ggcattacca tcggaactca aggtgagcgc cgacaatgtc 720 
tcattaaccg gtgcggtaag cctcgcatca atgctgacgg agatatttct cctgcaacaa 780 
gcacagggaa tgccggagcc ggggtgggga aggatcaccg attcacacca gtggaacacc 840 
ttgctaagtt tgcataacgc gcaattttat ttgctacaac gcacgccaga ggttgcccgc 900 
agccgcgcca ccccgttatt agatttgatc aagacagcgt tgacgcccca ccaccgcaaa 960 
aacaggcgta tggtgtgaca ttacccactt cagtgctgtt tatcgccgga cacgatacta 1020 
atctggcaaa tctcggcggc gcactggagc tcaactggac gcttcccggt cagccggata 1080 
acacgccgcc aggtggtgaa ctggtgtttg aacgctggcg tcggctaagc gataacagcc 1140 
agtggattca ggtttcgctg gtcttccaga ctttacagca gatgcgtgat aaaacgccgc 1200 
tgtcattaaa tacgccgccc ggagaggtga aactgaccct ggcaggatgt gaagagcgaa 1260 
atgcgcag^ catgtgttcg ttggcaggtt ttacgcaaat cgtgaatgaa gcacgcatac 1320 
ccgcttgcag tttgtaaggc agttattggt gcccttaaac gcctggtgct acgcctgaat 1380 
aagtgataat aagcggatga atggcagaaa ttcgccggat ctttgtgaag gaaccttact 1440 
tctgtggtgt gacataattg gacaaactac ctacagagat ttaaagctct aaggtaaata 1500 
taaaattttt aagtgtataa tgtgttaaac tactgattct aattgtttgt gtattttaga 1560 
ttccaaccta tggaactgat gaatgggagc agtggtggaa tgcctttaat gaggaaaacc 1620 
tgttttgctc agaagaaatg ccatctagtg atgatgaggc tactgctgac tctcaacatt 1680 
ctactcctcc aaaaaagaag agasuiggtag aagaccccaa ggactttcct tcagaattgc 1740 
taagtttttt gagtcatgct gtgtttagta atagaactct tgcttgcttt gctatttaca 1800 
ccacaaagga aaaagctgca ctgctataca agaaaattat ggaaaaatat tctgtaacct 1860 
ttataagtag gcataacagt tataatcata acatactgtt ttttcttact ccacacaggc 1920 
atagagtgtc tgctattaat aactatgctc aaaaattgtg tacctttagc tttttaattt 1980 
gtaaaggggt taataaggaa tatttgatgt atagtgcctt gactagagat cataatcagc 2040 
cataccacat ttgtagaggt tttacttgct ttaaaaaacc tcccacacct ccccctgaac 2100 
ctgaaacata aaatgaatgc aattgttgtt gttaacttgt ttattgcagc ttataatggt 2160 
tacaaataaa gcaatagcat cacaaatttc acaaataaag catttttttc actgcattct 2220 
agttgtggtt tgtccaaact catcaatgta tcttatcatg tctggatcga tccccgggta 2280 
ccgagctcga attcgtaatc atggtcatag ctgtttcctg tgtgaaattg ttatccgctc 2340 
acaattccac acaacatacg agccggaagc ataaagtgta aagcctgggg tgcctaatga 2400 
gtgagctaac tcacattaat tgcgttgcgc tcactgcccg ctttccagtc gggaaacctg 2460 
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tcgtgccagc tgcattaatg aatcggccaa 
cgctcttccg cttcctcgct cactgactcg 
gtatcagctc actcaaaggc ggtaatacgg 
aagaacatgt gagcaaaagg ccagcaaaag 
gcgtttttcc ataggctccg cccccctgac 
aggtggcgaa acccgacagg actataaaga 
gtgcgctctc ctgttccgac cctgccgctt 
ggaagcgtgg cgctttctca atgctcacgc 
cgctccaagc tgggctgtgt gcacgaaccc 
ggtaactatc gtcttgagtc caacccggta 
actggtaaca ggattagcag agcgaggtat 
tggcctaact acggctacac tagaaggaca 
gttaccttcg gaaaaagagt tggtagctct 
ggtggttttt ttgtttgcaa gcagcagatt 
cctttgatct tttctacggg gtctgacgct 
ttggtcatga gattatcaaa aaggatcttc 
tttaaatcaa tctaaagtat atatgagtaa 
agtgaggcac ctatctcagc gatctgtcta 
gtcgtgtaga taactacgat acgggagggc 
ccgcgagacc cacgctcacc ggctccagat 
gccgagcgca gaagtggtcc tgcaacttta 
cgggaagcta gagtaagtag ttcgccagtt 
acaggcatcg tggtgtcacg ctcgtcgttt 
cgatcaaggc gagttacatg atcccccatg 
cctccgatcg ttgtcagaag taagttggcc 
ctgcataatt ctcttactgt catgccatcc 
tcaaccaagt catt:ctgaga atagtgtatg 
atacgggata ataccgcgcc acatagcaga 
tcttcggggc gaaaactctc aaggatctta 
actcgtgcac ccaactgatc ttcagcatct 
aaaacaggaa ggcaaaatgc cgcaaaaaag 
ctcatactct tcctttttca atattattga 
ggatacatat ttgaatgtat ttagaaaaat 
cgaaaagtgc cacctgacgt ctaagaaacc 
aggcgtatca cgaggccctt tcgtctcgcg 
cacatgcagc tccoggagac ggtcacagct 
gcccgtcagg gcgcgtcagc gggtgttggc 
tcagagcaga ttgtactgag agtgcaccat 
aggagaaaat accgcatcag gcgccattcg 
cgatcggtgc gggcctcttc gctattacgc 
cgattaagtt gggtaacgcc agggttttcc 
gccaagcttt acactttatg cttccggctc 
aatttcacac aggaaacagc tatgaccatg 
aaataacctc tgaaagagga acttggttag 
ggaatgtgtg tcagttaggg tgtggaaagt 
aaagcatgca tctcaattag tcagcaacca 
gcagaagtat gcaaagcatg catctcaatt 
cgcccatccc gcccctaact ccgcccagtt 



cgcgcgggga gaggcggttt gcgtattggg 2520 
ctgcgctcgg tcgttcggct gcggcgagcg 2580 
ttatccacag aatcagggga taacgcagga 2640 
gccaggaacc gtaaaaaggc cgcgttgctg 2700 
gagcatcaca aaaatcgacg ctcaagtcag 2760 
taccaggcgt ttccccctgg aagctccctc 2820 
accggatacc tgtccgcctt tctcccttcg 2880 
tgtaggtatc tcagttcggt gtaggtcgtt 2940 
cccgttcagc ccgaccgctg cgccttatcc 3000 
agacacgact tatcgccact ggcagcagcc 3060 
gtaggcggtg ctacagagtt cttgaagtgg 3120 
gtatttggta tctgcgctct gctgaagcca 3180 
tgatccggca aacaaaccac cgctggtagc 3240 
acgcgcagaa aaaaaggatc tcaagaagat 3300 
cagtggaacg aaaactcacg ttaagggatt 3360 
acctagatcc ttttaaatta aaaatgaagt 3420 
acttggtctg acagttacca atgcttaatc 3480 
tttcgttcat ccatagttgc ctgactcccc 3540 
ttaccatctg gccccagtgc tgcaatgata 3600 
ttatcagcaa taaaccagcc agccggaagg 3660 
tccgcctcca tccagtctat taattgttgc 3720 
aatagtttgc gcaacgttgt tgccattgct 3780 
ggtatggctt cattcagctc cggttcccaa 3840 
ttgtgcaaaa aagcggttag ctccttcggt 3900 
gcagtgttat cactcatggt tatggcagca 3960 
gtaagatgct tttctgtgac tggtgagtac 4020 
cggcgaccga gttgctcttg cccggcgtca 4080 
actttaaaag tgctcatcat tggaaaacgt 4140 
ccgctgttga gatccagttc gatgtaaccc 4200 
tttactttca ccagcgtttc tgggtgagca 4260 
ggaataaggg cgacacggaa atgttgaata 4320 
agcatttatc agggttattg tctcatgagc 4380 
aaacaaatag gggttccgcg cacatttccc 4440 
at tat tat ca tgacattaac ctataaaaat 4500 
cgtttcggtg atgacggtga aaacctctga 4560 
tgtctgtaag cggatgccgg gagcagacaa 4620 
gggtgtcggg gctggcttaa ctatgcggca 4680 
atgcggtgtg aaataccgca cagatgcgta 4740 
ccattcaggc tgcgcaactg ttgggaaggg 4800 
cagctggtiga aagggggatg tgctgcaagg 4860 
cagtcacgac gttgtaaaac gacggccagt 4920 
gtatgttgtg tggaattgtg agcggataac 4980 
attacgaatt cggcgcagca ccatggcctg 5040 
gtaccttctg aggcggaaag aaccagctgt 5100 
ccccaggctc cccagcaggc agaagtatgc 5160 
ggtgtggaaa gtccccaggc tccccagcag 5220 
agtcagcaac catagtcccg cccctaactc 5280 
ccgcccattc tccgccccat ggctgactaa 5340 
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ttttttttat ttatgcagag gccgaggccg cctcggcctc tgagctattc cagaagtagt 5400 
gaggaggctc gaggagcttg g 5421 



<210> 7 

<211> 17732 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Lama2/APPA 
transgene 

<400> 7 

tcgagagtat ctttgtcagc tgtgcctcca acaaaggggt actgttgccc acatagaaag 60 
atctaaacta attaattaat ccctcacccg caaatctttc agtcactaag ttagcacgat 120 
tgttgaacaa gttctccaaa ggagagatac agatgagtgc gtatagggtg gacctggctg 180 
ctgaggagac acctgcatct gactaagaag agccacggtg ttagttgaat ggtgtggagt 240 
agggtggttc tgtgggacag tagaaaatcg agaggcatgt gccgtttagt gaactgatgg 300 
aagctacccc aaacgacaga gattgtcagt caggccaatc cgtttcgagt ttgatgggca 360 
gccggacagt gagacagaca cacctactca gttggaggaa ggatgagaac aatggccagc 420 
agggattgag agaccctgac aggcgcaagg ccctaacaca cacacctacc acctcacttg 480 
acaaagctgc caaagaccaa agacttgttc tccattagaa atgacagctg gcttgacccg 540 
acagcataat aagcagagtg tactctgatt ggagaacttt aatgtgtttc attcagtatt 600 
ataaaaggac agtattacag attttgttgt acactgctgt tacatgtggg gcagtgtgtc 660 
tttaagtagg gtaaagtact ctttaaaaat gggtcctaga tattttttcc tttaactcaa 720 
gtctcttact gtttaaaCga tttttatttt gtttaatatg gaggaaaaag aagcgtaaat 780 
ggacaatata tatttagaga aagatggtta gctgtcagaa aaatatgcaa atcaaaatca 840 
caccaagact gcagcacacc cctgtcagat ggctgtgatc aagaaaataa atgacaatga 900 
gtggtggtga agatgtacta aagggaaaca cacacacaca cacacacaca cacacacaca 960 
cacactggag caaccactgt ggaaatcagt atgaatggtc ctcaaaaacc tgaagataga 1020 
gcggggcgtg gtggcataca cttttattcc cagcactggg gaggcagagg caggtggatc 1080 
tctgagtticc aggccagcct ggtctatagc acaggttcta ggacagccag ggctacacag 1140 
aaaaaccctg ccttgattaa accaaaccaa accaaaccaa accaaaccaa accaaaccaa 1200 
accaaaccaa accaaaccag accaaaccaa aacactgaag atagaacttc agtattccat 1260 
tcct:agat:at atacccaatg gagactaagt cagcaagaca cctgcacagc catgttcact 1320 
actacactgt tcaccacagc caggctgtgg aaccagcctg agtgtccatg ataaatgaat 1380 
ggataggtaa ctttcaaggt aaatggactc tgctgtgtac atgcctcaca ttctgtttat 1440 
tcatttttct ttatgaggtg tccattcagg agtcacatgg tagttctatt ttcagtcttc 1500 
tgaagatact acactggtcc ccacagttta cacttttatc agcagtgaat aagggttcct 1560 
ctatccttac catcatttgt tgtaattttt cttgatgacc ctctttctga cagggatagg 1620 
atgtaatatc agtgtgagga agtacaactt gttttctaag tatttattgg ccccttgcat 1680 
ttcttctttt gaaaactgtc ggttcctgac atctgctcag gtattcattg gatgttgttt 1740 
ctttggtgtt tgagttctta tgaattctag atgttaaatc cctgcctgtg gttctctccc 1800 
attctgtagg ctgcctcctc accctggcaa ttgttgtcct tgttttgcag aaacttttga i860 
cttcatggaa tctcatttgt cagttttccc tcctctgcta tagcctgagc taatgcactg 1920 
gtttttacag agccctggtc tatgccttta tcctcctctg gcagcttcgg agtttcattt 1980 
cttacattta gatctttgat ccactttgaa caagttttgg agcagggtga gagatacgaa 2040 
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tctagttcca ttcttccata tgtgatccta gtttacatag catcgttggt tgaagaggtt 2100 
ttattttatt tttaaataat gtgtcataaa aaacgaggtg gttgtagcag tgtggatttg 2160 
tttctttgtc ctttgatcta caggtcttgt tttgtgtcag tctcatgatg ttttattgct 2220 
atggctctgt catacagtct gaggtcaggt attgtgatat accttcagta ttgctccctc 2280 
agactcaggt ttgctttggc caggagtcat cttactcagt gctcttagag ctcccccagc 2340 
atgtagctgc tactattctt agttgataaa tcaggaaact ggggctcaga gagattaact 2400 
gtcttgaact acttctgggg aggtgaaacg tggagacact aaactgtgtt taccctgtac 2460 
tgctccagta gctgtcgggt gctgggctac agcaaagcac ctatactata tattactcag 2520 
gaggtggaaa aactcagcct cccttggggt tcccaagctc ccaggtgtcc agtcactgct 2580 
ggaaacctca tggagtctga aaggaagggt tgagggtaca tggggcagcg atgaggagcc 2640 
tggggctggg atctcccaaa cacctggata tccagatgcc actgggtcag ggggagttgg 2700 
gaacagagtt gggatgtcca tggacctgtg acaaggccag ggccaggggg aggataacCc 2760 
tggctttact aatttgcgaa agtccttagc ttagcagcag ttgtctggga gcacagaggg 2820 
gccttctgta agaggctcag gcagtgccgc tctgtaggcg aaggtcttct ccatgttccc 2880 
catggtggtt cttgatgaaa gagacagtcc ttggctccaa actggtttat tgattgttca 2940 
ttgtggaaaa tgggtgcaca ccaccttctc agggtggacc agagatcaaa taccttttgc 3000 
agggaggaat atctgggaag ggacgcttac tggctaaacc ctcagggcct ctagatacat 3060 
cattagcatg gagaactctg ttctgggcta catgaccaca ggccacattt ccacaagcca 3120 
catgtgggaa gtgtggcaca tgttctaggc caggaatctg gtagggagcg tggagccacc 3180 
taccatccca ggtgggtgcc tgggtgccag ggaccctgaa cccgctcaac cttaccaagt 3240 
ttcctggcag ggtccactgt cctacacaga agctggagga ggtgtgaggg ttgtgtcttt 3300 
gtggaatgtc ccatgctgct tggggctcag tttctccacc tgtacctcat tggtttgggt 3360 
ataaaaagtg gggatacttt attattctct gactcggtcc tgaggaaaaa gcatcgtggc 3420 
agtccaggaa ccacaccctg aggttcctgc actgaaggga ctccctaagt ctctggagtc 3480 
tctccccttc acagagctgc caaagtctag gttcttttga ggataacaga gccatgcttg 3540 
gtaagcagac aacagcattt gtttactcaa ccttcttttg tcagctccct cttcataaac 3600 
aagttgagac accatgctgg cttgaggaag acttctaaag ccagacaact gtgcaaggaa 3660 
gaagaagaag gggcaagtgg agttagcctg gatgtagccc tcaaagtctc cagagaccag 3720 
ccatgaaggc tcaagtggag ggcaagacct gcagcagcca agcatctggc aggagaggat 3780 
cctgggaacc cctctaccat gacacacatt cttcctgcag gtcacactta ataggccatt 3840 
tcttatttgg atctatcatg gtgttctgtg cgagattaat gaggtgttat gctgcgaaca 3900 
gaaagttata taaiaaacaag tccccccccc ttgtcactgc tgctaagaat gtagcagaaa 3960 
ttgtctcaag tgtctctcta atcagaaaca ataaaggtct ccttggattc aagccctcca 4020 
gtttcctcct tccttgctga gccttggaca cccatacaaa cctcctggat gctacagctc 4080 
tgggcagaga ctccaaggtg gggagagact gatggtacaa aagcaaaata cttgtttggg 4140 
ggtacaccca ctcctctgcc tgtgtggttc ctgcagtcag tcctgcagac aggccctcag 4200 
tgggtcttcc atgggcaaca cgcagaggga ggcaatggat gggaataccc acaccctggt 4260 
tagtttaccc cggccatgct ctctgctctt catccctcct ctgccctctg ccacggcttt 4320 
ctctgcagga atcatatctt catattggcc cacaggtgtt ctcctcaccc tagctatgat 4380 
gtttacttta gagtgacctt agcagggctg gtgggaatga gttctagaag gctcacggag 4440 
atgctaggga agaaacgtct tctaactact gaggttacta agttcctggt ggttgtctct 4500 
gcctttccct tgttaaagtc accttgaagt tagtgcagaa gaaatcagag cccagtcaca 4560 
gagtaaatat ggtcctgaag atttcctttg agtgcccaga atccatgaca tttcaagagc 4620 
cctctttgta ccttaagtca tttggggttg tatcttctgc ttgatgtatg tgtgtgtgtt 4680 
tatcaaagag tgagatggtt acataagagg tgctctaaag gacagagagg atttgcaatt 4740 
gtggcatgtg acatcctcag gccttgctct ggtgccagga ggaactgatg cagaaaagag 4800 
taagaggtca tttcctggag gctgtcacta tagaggagat cttacagtgc attccctcct 4860 
ccaggccctg cctgaggata gacatgtgct gactgcaact gaaacagagg cttgggatgg 4920 
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agagttaggt tcacagaagg gagggtggga gatggatgct tgctgggttc tgggtctcat 4980 
caccagctcc tgaccacccg gtcagcccat gtgcttattc catagctttc ttttgctatg 5040 
tttactcagt gtggtgtttg ttgggaccca gcagaagcca gtcccaggct gacagctgtg 5100 
gatacacagg gcagcatgag ggtcctcagc ctgaagcagt caggctggca gaagagaaag 5160 
accagcacac attccttcaa ccaactatgt cttgaaaaac aaacatatta tatcacatat 5220 
attgcattta tgagacagct aaaatgtact cgggtagcat gactccaggt ggggatatct 5280 
gcaagtgcca tgagtggcag agggacagcc aatgtgaggc aagaaggaat tctggctcaa 5340 
cacagcttag ctccctggtg ttggttcaaa ctttgagagt ttgaccacaa gcactttatt 5400 
tttgacatat ttaaacagag cacaactttg ggaaaaagtt ttcttatgaa aattatcaca 5460 
ataaagctta aggcatgact acattaaaat gcctttgcaa agtatatgtg ccctcttcca 5520 
caagaatggt tctattgact gagaaataat gttcaggata aagatccagg aagaaaagat 5580 
cagggataag taaaatacta aactcttttg caaagtacat agaccctctt tcataacaat 5640 
gggttctatt gactgacaag cactgctcag . gagttgggaa agagtctagc ataagcacga 5700 
tagcctggag actctagtga ggtctagtct tacagacagc aaaaatcacc aggttacaaa 5760 
ctacattcat ttccagtttt ctgatcaggc acaggtatga atcccttctg ttgaagagaa 5820 
aagtccatgt gtttaaaata tctggtttct ccagtgctat tagcgagaag acttgagccc 5880 
tatacaacCc ccacctggag tgacatcctg tcttcatggt atattacata cctagacacg 5940 
ctcatctcac agacttagga ctttgtcttc tgatctccat ttctgatccc acttccacct 6000 
ttgccttgat agtgtcattt tcttcactgc cttggtgaca accatgttat cctctgtgta 6060 
tttgagtgtt accattttca gattttacct gtatgcaaga tcacacagtc tttgtctttc 6120 
tgtctggatg catgctaatc tctacacaac aacccttccc cgtcactcag atcttcctcc 6180 
attaacacat acatggtgct gaagaggcta gggagcttcc cttcagtggg gagctagctg 6240 
gctattgggc ctttttgact gtccaggaag gcccccaatt gctgagacaa gaacttagat 6300 
tcttcattat tgactctaac tcatgtatca agcagaagct aatgaatagt tatcaacagg 6360 
atcagaggtt ccagtgtaag acactttgac atgaaagaac ggaggaagga cagatggatg 6420 
cataaaagca ggaccactgc cccaggaagg tcctggaaac tgatgcaggg caaaggacag 6480 
gttataaacc aaatcttagg gagtcaggaa gagcacagag gagctcaacc aactgaccac 6540 
tgcttagggg ctaccaaccc aatcctccct gtgggaacag ctaagctatc agccaagggt 6600 
aataaacagg caggacctgt ggatgacatg gagagcatag ggaccctggg tccagccttt 6660 
agcacctgca ctctcaggat actccaccat tgtgtcttag agagcctagg gatactgggt 6720 
ccagcctttg gtaccttcac tctcagggta ccccatcact gtgtcttgga gagcctaggc 6780 
accctgggtc cagccttcag tacctgcgct ctcaggacac cccaccattg tctcttgccc 6840 
cgtctcttct tcctcttcct ccctttcatt gtctcttctc tgtttctttc ttgactctcc 6900 
tttcccctca caccctcact ctagttctcc ccttccctct ctgcatcacc ctattctctc 6960 
tgtggtccct ccactttcct ttatctctca tgcttctctc ctccctcaaa tacttgtcac 7020 
ccactatact tcaggggcca gctctagtga caaagctgtt aatagcaaga ctctcagatc 7080 
tccaacggct cagaggagcc agacccacca agaactctct ccaggtccaa tttcaggttc 7140 
cttcgaaagc tttcagcaaa tgctcaggga acatgccact aacaagaaga tgcaaattcc 7200 
agttgagagt gggaaaggcc cttgcgtagg tcccatcttc caggccaagg tcagaggggc 7260 
tctgtgtaat ccggattgac agggctcaga acaatgtttt gtttttaagg tttatttatt 7320 
ttaggtgtta gtgtctttgc ttgcatgacc ttatgtgcat catgtgtgtg caggttcctg 7380 
atgacagtag aggagggctt tgaatccctg gggataggaa gttacaggaa attataagct 7440 
gctttgtggg tcttctagct ttcccaacag aagtgaatgc tcttcaccac tgagccatct 7500 
ctctaggccc aagagacatt gctttatgga tataattgtg tgtgtgtgtc aacattgagg 7560 
aaagggaaat aaaaaaaaaa cttcagccgc taaggttgta cagtttcact aattgctact 7620 
tttagttgtg ataaaatggc aggtgcttca acatttatat atacaaaaac ttccctgctg 7680 
gtggttcaac tgtgagaact ggggtaagtg ggtgagttct ctttttctgt ctctgtctct 7740 
gtctctctcc ttccattctt tcttaaagga aataaacatt gcagctgggt tatagctcat 7800 
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caatatggaa gttacagaag tgaaaaaagg cattgccttg gtgggtggtg ttaccagctg 7860 
atttttggtt gtcctgcaag gaggtctggg gactggctgc tctgtctctg tctgtatgag 7920 
tgagggaagt ctggggagca gattccctaa ccttcagcct ggcctggttc ctgagtgaac 7980 
ccagcctctc tggtcctagt agctttttcc aaacaggaat ctgagtggtg acagggaaca 8040 
agtaccagcc cattgcttaa gtgccagggt tagtgagggc aggaagctgc catagctggg 8100 
attagtagtt gtattggatg taggaagtcc tatcctggga cagctaatcc ttaatgcttc 8160 
actggagatt ttcaatgaga aatttatccc acggcccata tggccccatc cttttgtctc 8220 
caacagccaa gtattttcca ttagaggaga cttcctgtac acttgatgga tgctcattcc 8280 
aaggtgactt ggggcagtca gtacagactt gggatgacct ctgacagcct aacctctccc 8340 
caacaagggc cctctatgtt tgctatgtaa tgtaatgtca gacattgtca ggagtgtccg 8400 
cagcacagcc tgcccagtgt gagggctctc ataggtttcc cactgtctta tctacacagg 8460 
gataacgagg aggtaagctg cagttcccag tctcacttca cagaggaaga gataacccca S520 
tcccaggtca tgtagccagc agtggaaaga atgaggattt gaactcaggt cttccaagtc 8580 
ccattgatag catctcctca caagtccctt gccaccctca cgatgcctta gacacttgcc 8640 
tgccctttat actaaggaga tgcaggtaca aggggtttac ccatgtagca gctgaggcag 8700 
ctggggatag ataccagcag caggcctgat gtcaccactc taactccagc atccccagtc 8760 
tgtgttcctg gagtgtgaaa atccctactt aacaagattg tgcaacagtc cttggctctg 8820 
tgacccatag ctggaaacag gattctcatt gatttgtgga acatggtggc agccagccaa 8880 
aaagagggtc tgcatacaga agacacgtgt ggcaaggcca cagcagactc tgactacctt 8940 
agcttacaga attacaaggt cataatgtcc tctgctttgg tcacctcatg ttaaggacag 9000 
gccctaatga agatggggca gaagactgaa ggaatggcca accaataact ggcccaactt 9060 
gagacccatc ctacaggcaa gcatcaattc ctgacactac taatgatact ctgttatgct 9120 
tgcagacaga agcctagcat aactatcctc cgagaggtcc acccagcaac tgactgaaac 9180 
agaaaaagat atccacaggc aaacagtgga tggaggtcag ggactattat gggagagctg 9240 
tgggaaggat taaaaaccct gaaggggata ggaaccccac aggaagacca acagagtcaa 9300 
ctaagagacc tgtgggagct ctcagagact gagccaccaa ccaaagagca tacacaggcc 9360 
ggtccgaggc acctggcacg tgtgaagcag acatgcagct cagtctccat gtaggtcctc 9420 
caataagcgg tagcctgact gcagtatcca atccccaaca gggctgcata gtctggcctc 9480 
agtgggggag gatgccccta atcctgcaga gacttgatga gtggagagct atccaggggg 9540 
aacccaccct ctctgagaag ggaatgggga tgggggaggg actctgtgaa gaggggacaa 9600 
ggacaaacaa gaacctcaaa taggtcaggc cctaaaggct tgctaagtag cagtggccca 9660 
gctctgtcct gttcctcagc ccaaggctca gctcccacct gtttctgtgt ttttctggct 9720 
tttcatgggc ctaggacttg gtgaccagtt caaacaatgg ggcctgtgga agacacaata 9780 
tacaagacta gggacattcc tgttctgctg actatccata gcctgatgta ggtggaagga 9840 
cccaatcact ggatttctac ccttgcacaa ccttgacagc tgagggcctc tcagaaacct 9900 
atttcttcca ctgaaaaatg agactctcaa atgaacgtcg tgacaatcat caggcttatt 9960 
aaagaggtgt atctaacctg aatggcaagc agacagcagg caaatgtctg tatcaacctc 10020 
taggaaggac aagaactgct cactgctgcc ccccaggagg ccatttgctg aaacagctgc 10080 
tctcctgctg gtgcacaggc cctgccttct cattgcagcc acagcccctt cctgtctgaa 10140 
cctcctgtca ggtcactggg aaacagatca agatggaaca ggacagctcc tgatggtaaa 10200 
taaaaaacag tggtcatggc tattcatagg ggtttatgct tcttcagtcc acactgtgaa 10260 
gagctgtggg catgaaccac agtgttcgag gtagagttgg ggttctgaaa ttcacagtgg 10320 
ggtgagctca gtaaatgtga gctggaggtc actcgtgaga cacacagtcc tgctgcttct 10380 
gttcccaata tcctgaggag acgacacatc tactttgttc agaggccaca gtctagttga 10440 
cctgagagtt accagtttct tatttgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg 10500 
tgttgttcgt gtgtgagtgc aggtgcacat atgatagcgt acacgttgag gtcagaggat 10560 
aactatcagg cgttgtcccc tcctactttt cctcggactc tggagaacaa acatgggtcc 10620 
ttattccagg ggagcaagtc gctgttggct gacacatctt gctcacatac attttaccta 10680 
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gacaatggag cctccatcag agtattactt tagctcctca ccgatggcaa tgcaccacct 10740 
ctctacccac ataggagttg ggtctccaca cacccccaca cccccttcac caaaacgttt 10800 
tcagttactt tatctggtaa agttcatcag agaatgaagc cagtattaag aacatggaat 10860 
catttgggaa cctggatcta gcaatacccc accctagatg gagttgctga gttttcacct 10920 
cagattataa ttccccccta gcttctatgg tttattctga aaccagggga actcgattcc 10980 
tccctttgga ccacagacat cctggcttgt gaattcacat gtcatctact gctaatccat 1104 0 
tggtagtatg tggctcacag agacacacta qagtcatggc caatgtcaag gtaggacaga 11100 
tgtgaatcat tcccccagtc ctgctgtttt catgactaac cctcctcagc acagtgacca 11160 
tgaacctact tttcccctcc ttttattttt agaattgctg gaattttcta ttttgagaaa 11220 
taatagcctt gggcagcatt aaacaaaatc atctagaaag ctggtttaaa atacagatgg 11280 
ttgagtcagt gaaagagtga ggaatgtcat tattggcccc tcacagaggc tggctcactc 11340 
cagcagaggt ggttgaagct cttggacacg ggtcaggtgc ataggaaagg tngtctggga 11400 
cactgagaac cacaattgaa caaacagaac tgttggcttt ttttttttta aatgagttct 11460 
caaaaaatga ctggctagct taggcaaata cttcgagcca acccaacaga acattcttcc 11520 
attgattcat tctggatctt ctttctagac aatactgaac tgaccccttg ttggcagtct 11580 
caagtttgac aacatagggc tttgaacttg gcacaaggtc catcactgtc acccaagcat 11640 
cctgggtgac ctttgggttg gaatatcttg gctaacctta gatattttct ttggagtatc 11700 
tttagaacat ccaggaaata gggcttgatt ctcatcctgg gaccacaata taagtcaccc 11760 
tagaatccca ggagatcgtg cagagaaaca aggatctctc tcgtgtgcat ccttcttcaa 11820 
agcagtgagt agtgactcca ctaaactgag ttcccatctg agagtccaca ggaggctttg 11680 
gggcaagaag cagagggaag gcactgtttg tgttggtaaa gttttgactc taacaaattt 11940 
gaagacatag atgacattgt gtcagactaa caacaaccta gactcatgtg ggttctgttt 12000 
agggatcaga ttttattcat caatgacttg tcttagtgta tagagaaagg cttcctactg 12060 
gagtgtaggc tcaataatga cagaagagat agctatttcc cctagggact gtgctgctcc 12120 
aagtttggtg gagaaaggca gtggggaacc tagatgtgct ctctggggag ggggtctgaa 12180 
gctggcttca tagaaggtgt gaagttttgc tgaaacatct aaacagaatt atagcttagg 12240 
aaagtgagca ggcaaggcag ggaatgtgtt gcatatgtat atgtacatga atatattatg 12300 
ttatagatac acacacattt gaacctcatt tgcagatgac agaaaatagg ttattttgcc 12360 
tctcttaact gctaagcaca atgacttcca gttccatcca tttcctgaaa tgccacaatt 12420 
tcatttttca ttgtggctga ataaaiattcc attgcagact gggccctact tcatccactc 12480 
ctgagggcag gcatatcccc tggctccatt tcttacctat tgtgaagaga agtgcaactg 12540 
tcttgttgaa aggcaagcgt gagagaggca ggcactaatt gtgggttttt gtttcttctt 12600 
cctgctatga ctctccattt gtcagetacca aagatcgata aaagccgcca ccatgaaagc 12660 
catcttaatc ccatttttat ctcttctgat tccgttaacc ccgcaatctg cattcgctca 12720 
gagtgagccg gagctgaagc tggaaagtgt ggtgattgtc agtcgtcatg gtgtgcgtgc 12780 
tccaaccaag gccacgcaac tgatgcagga tgtcacccca gaogcatggc caacctggcc 12840 
ggtaaaactg ggttggctga caccgcgcgg tggtgagcta atcgcctatc tcggacatta 12900 
ccaacgccag cgtctggtag ccgacggatt gctggcgaaa aagggctgcc cgcagtctgg 12960 
tcaggtcgcg attattgctg atgtcgacga gcgtacccgt aaaacaggcg aagccttcgc 13020 
cgccgggctg gcacctgact gtgcaataac cgtacatacc caggcagata cgtccagtcc 13080 
cgatccgtta tttaatcctc taaaaactgg cgtttgccaa ctggataacg cgaacgtgac 13140 
tgacgcgatc ctcagcaggg caggagggtc aattgctgac tttaccgggc atcggcaaac 13200 
ggcgtttcgc gaactggaac gggtgcttaa ttttccgcaa tcaaacttgt gccttaaacg 13260 
tgagaaacag gacgaaagct gttcattaac gcaggcatta ccatcggaac tcaaggtgag 13320 
cgccgacaat gtctcattaa ccggtgcggt aagcctcgca tcaatgctga cggagatatt 13380 
tctcctgcaa caagcacagg gaatgccgga gccggggtgg ggaaggatca ccgattcaca 13440 
ccagtggaac accttgctaa gtttgcataa cgcgcaattt tatttgctac aacgcaogcc 13500 
agaggttgcc cgcagccgcg ccaccccgtt attagatttg atcaagacag cgttgacgcc 13560 
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ccatccaccg caaaaacagg cgtatggtgt gacattaccc acttcagtgc tgtttatcgc 13620 
cggacacgat actaatctgg caaatctcgg cggcgcactg gagctcaact ggacgcttcc 13680 
cggtcagccg gataacacgc cgccaggtgg tgaactggtg tttgaacgct ggcgtcggct 13740 
aagcgataac agccagtgga ttcaggtttc gctggtcttc cagactttac agcagatgcg 13800 
tgataaaacg ccgctgtcat taaatacgcc gcccggagag gtgaaactga ccctggcagg 13860 
atgtgaagag cgaaatgcgc agggcatgtg ttcgttggca ggttttacgc aaatcgtgaa 13920 
tgaagcacgc atacccgctt gcagtttgta aggtacccgg ggatcacaac ttgccctctg 13980 
aagaggaaga acagaaggat gccacaactc tcctgctggc tactctccag tggtttcatc 14040 
ttacttctga tggcatttcc ctctagaaag tgctactatc atccacacat ttctacctga 14100 
gaccacccaa aggaccctcc caaattctct tcctctctga gtagtctcca cacctgttac 14160 
caccatccca gaattaaaat cctaactgca ctctggcgtg tgacttgcct cagtccttgc 14220 
aataagagtt gttggcagtg ccaggcgtgg tggcgcacgc ctttaattcc agcacttggg 14280 
aggcagaggc aggcggattt ctgagttcga ggccagcctg gtctacagag tgagttccag 14340 
gacagccagg gctatacaga gaaaccctgt gtcgaaaaac caaaaaaaaa aaaaaaagtt 14400 
gttggcagag tgtgg[gttat ataccaggtg gagatttcaa atgagtggct gaagctgtag 14460 
ccagaaggaa cttagaggat agctcataac ttaaaaagaa atgtagagag tagcagaaac 14520 
attgagagag tgggcacaca gccactgtgt gaatgtggca gaacacaatc cagccagcta 14580 
tacatgcata agtgtatatt ggcgccatcc tgactgatga gacacaggaa aacagataga 14640 
cggggttagg tggccatggc ctttcctgcc tgcctcttcc taagggtcat ctcaagacct 14700 
tatgctctct taactcttcc attgctactt agcttctaga tatcacctcc agattagtct 14760 
ccttgggtac atcagtgatc ctggtgatat ccagggcttc ctgattccat ctttgtcata 14820 
gaggctgcaa ctaaagaggt cttcttaata cttcacaccc tgatgccaaa aggaagacac 14880 
agaagttcac agaggtgaag tgattcatgt aggacataca gtgagcaagc atcagggtcc 14940 
ggattatctg actctactct aacttttatg taaatgtgct ttatgccatt aacactgtca 15000 
ttcctgtgct tcagctctgg gagactccca agcactctta ggcacaagcc acaattaagg 15060 
gactctgaca ctctgcattg attaattagc atggtggtct ctatgtttcc agattcatga 15120 
ttgtttcact ttccatatag gctatgaagg gtgtgaggaa attttttggg gacagaattg 15180 
gaggcaatcc acctctctca ggaagcctct atctggaaaa gcttacaact cagggacagt 15240 
aactgtaggc ccagtccttg gtgtccaaaa tgggttttat ggtttgaatc tgcaaagcct 15300 
tccatgtgct caaaggtttg aacatggagc ctcctcctgg taacactgta ttggaggctt 15360 
ttgagactgg atgctctttg gtcccatgtt ttgctacatc atctgtcaag atatgaccca 15420 
ggcatgctac cagctaccac agactatgcc tctccagctt tcatgttctc cccaccatga 15480 
tagacttgta tctcctaaaa atggaatcaa agcaaacttt tcctgcatta agtttttttt 15540 
tttctgttaa gtgtttggtc acagggacaa gaaaacactc aatacagata attagtacca 15600 
V gagttgaggt tcattgctct agcaagttgg atcaaatttt tagggctttg gaactgattt 15660 

ataagagaca tgtagaagag t:ctgaagctg tgggctacag aagtgtcacc agtttttaag 15720 
aatagtttaa tacaccatgg gaattgtgaa aatcagaatg ctcacacaaa ggcagacagg 15780 
aaaacgtgag catgtggcgt gtgagagggc ataagaagga acctaggggg aaatgagcta 15840 
gaagccattc ggctacgtta gggaacgtgt gtggctgtgc ttggcccatg ccctggcaat 15900 
ctgaatgagg ccaaatttta aaggagtgga ctaactcgat tgtcagagaa aatatcaaga 15960 
cagaccacca ctcaggctat gccgtgtttg tgaccgacca gctactctta gccagctcta 16020 
ttgtgaaatt ccagagcaat tatcagagca tgaagataca tacagtttag tgaagtaagg 16080 
ggtgtgggtc cctaagtgga tggtgcataa atctatgtag gtgatgccta agtgacactt 16140 
gataatccaa aatatcagca atgtggaatg tcttccaagg agacctgtag acacacattt 16200 
tagaactttg ctcatggctg taataaatag ctagctagaa atcatttcct gaagaggtta 16260 
gtctgagtta cggttccagg gcaaacattc agtgatggca aggaaggcat tgcagtcagg 16320 
agccaaaggt cagctggtca cattgcatca agagtagaga gtcagagtgt gagtagaaag 16380 
aggatacagg ttataaaacc tcactgtcca ctctcagcaa tccattttct cctaaaaggc 16440 
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tttaccttct aaagatttta gtcttcaaaa ccagtaccag tagcctggga acaaaagttg 16500 
aaacaaatga gcctttgtgg ggcatttcac acttaaaaca gggcatcacc taggaggagc 16560 
cctgtgtgca gtaggaagtg tggcctctgt gtcaggaatg ctcaggctaa taaggggtcc 16620 
tctatctgag ggaccctatg aagattcaac aagtagttgt gagaattccc tgtaaatgga 16680 
tgctaccaat ttgacatttg tagacctgct attgtgtgct tctttattgg gctctcccat 16740 
ctcccaactt tccaacccat attccacatt aatcccttcc accaccatgc aacactaggt 16800 
aggagagaag gaaggttaga agagaaagtg ggtatagatc tatttagact acttcctgct 16860 
gattaggggc aagtccaatc gtcattgtca ggatacctcc aaccagcaac cagcaaacca 16920 
gcaaatcaga aacagcaaaa gcagccaaca aggcagcact aaccagcagg attggggtcg 16980 
gtagcgtggg agcagtcact actggtcttc tcatggcttt ggcattaata ctctctcaag 17040 
aaattccgta attttttccc caccacctga aattccgtaa ttttaaatgc aaactatcta 17100 
cagctggcaa aaatcacatc tctcctagag cacaagacaa atcatagtta ctggctattt 17160 
gcaatctgaa gcatctcaat atcccacacc tgggattaaa acaaaaacat attcacatca 17220 
cataactgtt ttttttttcc aattttttat taggtatttt ctttatttac atttcaaatg 17280 
ctatcccgaa agtcccctat accctcccac ctccctgctc ccctacacac ccactcccac 17340 
tttttgaccc tggagttccc cggtactggg gcatataaag tttgcaagac caaggggcct 17400 
ctcttcccag tgatggccga ctaagccatc ttctgctaca tatgcagata gagacacgag 17460 
ctctgggggt actagttagt tcatattgtt gttccaccta tagggtcgca gaccccttca 17520 
gctccttggg tactttgtct agctcctcca ctgggggctc tgtgttttat ctaatagatg 17580 
actgtgagca tccacttctg tatttgacag gcactggcct agcgtcacat gagccagcta 17640 
tatcagggtc ctttcagcaa aaccttgctg gcatgtgcaa tagtgtctgc gtttggtggt 17700 
tgattatggg atggatccac tagttctaga gc 17732 
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